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CHAPTER  I 

INTRODUCTION 

United  States  Air  Force  (USAF)  acquisition  and  logistics 
efforts  in  support  of  aircraft  turbine  engines  constitute  a major 
management  and  fiscal  program.  For  example,  the  active  inventory 
of  USAF  aircraft  turbine  engines  is  approximately  38,000  which 
represent  a total  investment  of  some  $10  billion  in  1975  dollars. 
Additionally,  nearly  $500  million  are  spent  annually  to  support  and 
maintain  this  active  inventory  of  jet  engines  (24:1). 

During  the  past  decade  the  annual  cost  to  operate  and  support 
(O&S)  USAF  systems  (including  aircraft  jet  engines)  has  increased 
severalfold  (24:1).  An  unquantifiable  portion  of  the  rise  in  O&S  costs 
has  been  attributed  to  the  lack  of  emphasis  placed  on  downstream 
operating  costs  during  the  early  conceptual  and  development  phases 
of  a jet  engine's  life  cycle  (2:1). 

The  USAF  has  developed  a program  for  providing  cost  esti- 
mates. Air  Force  Regulation  173-1  delineates  major  command 
responsibilities  to  accomplish  the  task  of  cost  estimating  for  pro- 
posed and  developing  systems.  In  accordanc*'!  with  this  directive, 
USAF  cost  estimates  for  aircraft  jet  engines  are  developed  by  the 


1 


major  command  responsible  for  a particular  phase  of  an  engine's  life 
cycle  {22;3).  The  Headquarters,  Air  Force  Logistics  Command 
(AFLC)  is  responsible  for  providing  preliminary  cost  estimates  on 
proposed  and  developing  jet  engines  which  are  modifications  of  exist- 
ing  jet  engines.  The  AFLC  is  responsible  for  estimating  those  costs 
in  support  of  jet  engines  which  will  be  funded  by  the  AFLC  once  full- 
scale  development  is  complete.  The  primary  area  of  concern  to  the 
AFLC  is  O&tS  costs  which  encompasses  investment,  maintenance,  and 
support  costs,  including  depot  level  overhaul  cost.  Within  the  AFLC, 
the  Independent  Cost  Analysis  Division  (ICAD)  is  the  responsible 
agency  for  developing  required  cost  estimates  (23). 

Headquarters,  Air  Force  Systems  Command  (AFSC),  on  the 
other  hand,  is  responsible  for  providing  preliminary  cost  estimates 

I 

I for  jet  engines  which  must  undergo  a research  or  development  effort. 

I Within  the  AFSC,  the  Air  Force  Aero  Propulsion  Laboratory  (AFAPL) 

I IS  the  focal  point  for  jet  engine  development  (24:29-35)  and,  as  such, 

is  responsible  for  coordinating  life  cycle  costs  (LCC)  estimates 

...  during  system  design,  concept  evolution  and 
selection,  [and]  fullscale  development  ...  to  ensure 
[that]  appropriate  trade-offs  [are  made]  among  invest- 

; ment  costs,  ownership  costs,  ...  and  performance  [7:5]. 

P 
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PROBLEM  STATEMENT 

One  area  for  USAF  jet  engine  O&S  cost  control  is  depot  level 
overhaul  cost  estimates  for  proposed  USAF  jet  engines  (23:i:  12). 
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The  U.S.  budget  allocation  for  defense  spending  has  become 
more  austere  each  year.  Between  fiscal  year  1964  and  fiscal  year 
1976,  the  allotment  for  defense  has  been  reduced  from  43  percent  to 
25  percent  of  the  Federal  budget  and  actual  purchasing  power  is  less 
now  than  12  years  ago  (6:10-76). 

While  U.S.  defense  spending  has  been  decreasing,  the  cost 
of  operating  and  supporting  weapon  systems  within  the  Department  of 
Defense  (DOD)  has  been  increasing  (12:1).  This  situation  presents  a 
significant  problem  for  the  DOD  that  has  received  considerable  atten- 
tion in  recent  years  with  respect  to  finding  a solution. 

In  February,  1976,  Deputy  Secretary  of  Defense  W.  P. 
Clements,  Jr.  released  a memorandum  to  the  Secretaries  of  the 
Military  Departments  wherein  he  said, 

I am  seriously  concerned  with  the  continuing  growth 
of  the  fraction  of  the  total  DOD  resources  needed  to  oper- 
ate and  support  our  weapons  and  the  decline  in  funds  for 
new  weapon  procurement  . . . [We  must]  increase  empha- 
sis on  controlling  the  outyear  operating  and  support  costs 
of  weapon  systems  during  the  development  and  acquisition 
phase  both  through  attention  to  design,  procurement,  and 
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support  planning  ...  I am  equally  concerned  that  insuf- 
ficient attention  is  being  paid  to  controlling  eventual 
system  OScS  [operating  and  support]  costs  during  con- 
ceptual, validation  and  full-scale  development  phases  of 
new  systems.  My  objective  is  to  achieve  an  overall 
reduction  in  the  fraction  of  each  Service's  outyear  budget 
allocated  to  O&S  costs  of  the  new  systems  we  are  develop- 
ing . . . From  this  time,  each  DSAHC  [Defense  System 
Acquisition  Review  Council]  review  is  to  specifically 
address  the  O&S  cost  impact  of  new  systems  compared 
to  those  to  be  replaced  or  augmented  . . . [21:A1-Z]. 

The  DOD,  concerned  with  reducing  the  fraction  of  each  Serv- 
ice's outyear  budget  allocated  to  O&cS  costs  of  new  weapons,  is  utiliz- 
ing the  Design-to-Cost  (DTC)  program  and  the  requirement  to  develop 
life  cycle  costs  (LCC)  estimates  for  developing  weapon  systems  to: 

(a)  achieve  the  lowest  possible  unit  cost,  and  (b)  to  provide  DOD  | 

decision  makers  with  a trade-off  alternative  based  on  the  lowest  esti- 
mated LCC  (26;  27). 

The  DOD  has  stipulated  that  costs  of  acquisition  and  owner-  < 

ship  are  to  be  established  as  separate  cost  elements  and  translated 
into  firm  DTC  and  LCC  requirements  for  a system  selected  for  full- 

1 

scale  engineering  development.  Weapon  system  program  actions  are 
to  be  evaluated  against  these  requirements  with  the  same  rigor  as  the 
evaluation  of  system  technical  specifications  and  requirements  (28:9). 

Design-to-Cost  requires  the  translation  of  acquisition  and  ownership 
(development,  production,  operation  and  support)  costs  into  "design- 
to"  requirements  and  the  utilization  of  these  requirements  to  evaluate 
system  development.  It  also  requires  that  trade-offs  be  made  among 
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performance,  development  schedule  and  coats  if  the  end  result  is  to 
be  production  and  OScS  cost  reduction  (20:84).  Life  cycle  cost  includes 
aill  costs  from  the  conceptual  phase  of  a system's  acquisition  cycle 
through  validation,  full-scale  development,  production,  deployment, 
and  the  revitalization  and  disposition  phases  (30). 

To  reduce  the  O&S  costs,  efforts  are  being  expended  to 
design  in  improved  reliability  and  maintainability,  and  to  use  lessons 
learned  from  existing  field  systems  in  the  design  of  new  systems. 

Cost  estimates  are  being  used  for  source  selection,  contract  negoti- 
ations and  the  establishment  of  contract  incentives.  Cost  estimates 
are  also  used  to  provide  O&cS  cost  predictions  early  in  the  acquisition 
cycle  so  that  considerations  of  design  alternatives  to  maintenance  and 
logistics  concepts  will  be  available  to  provide  significant  cost  reduc- 
tions (19:11)« 

COST  ESTIMATING 

A memorandum  in  December,  1971  from  the  then  Secretary 
of  Defense  Packard,  advised  all  Service  Secretaries  that  parametric 
cost  estimates  and  analyses  for  each  weapon  system  in  the  acquisition 
cycle  were  to  be  incorporated  in  each  Defense  System  Acquisition 
Review  Council  (DSARC)  presentation.  This  memorandum  was  fol- 
lowed in  January,  1972  by  a memorandum  from  the  then  Secretary  of 
Defense  Laird  which  reaffirmed  the  importance  of  cost  estimates  in 
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the  acquisition  of  weapon  systems  and  established  a Cost  Analysis 
Improvement  Group  (CAIG)  within  the  Office  of  the  Secretary  of 
Defense  (OSD).  The  responsibility  of  the  CAIG  is  to  review  cost 
estimates  presented  by  Services  to  the  DSARC  and  to  develop  uniform 
criteria  to  be  used  by  all  DOD  units  in  making  cost  estimates  (29: 

Enel  2). 

The  purpose  of  cost  estixnating  is  to  produce  reliable  cost 
estimates  for  use  in  decision  znaking  during  the  acquisition  process. 

Cost  estimates  are  used  by  DOD  decision-makers  to  weigh  the  poten- 
tial values  of  a developing  system  against  its  predicted  LCC.  The 

i 

^ same  estimates  are  also  used  for  (8:153-154): 

1 I.  Planning:  Cost  estimates  provide  information  to  support 

f 

' the  long-range  plans  for  proposed  weapon  systems  and  are 

, also  essential  considerations  in  prediction  of  the  LCC  com- 

ponents  of  a developing  system. 

I 2.  Budget  preparation:  "Within  the  DOD,  cost  estimates  are 

used  to  support  both  funding  requests  from  the  Congress  and 
funding  apportionment  among  the  services. 

^ 3.  Contract  pricing:  During  procurement,  government  cost 

j estimates  are  used  to  establish  price  negotiations  objectives 

1 and  to  provide  government  contract  negotiators  with  cost 

I estimates  independent  of  the  contractors'  estimates. 

4.  Estimating  the  cost  of  contract  changes:  Cost  estimates 
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»re  useful  in  providing  an  insight  to  project  managers  of  the 
impact  program  changes  will  have  on  the  system's  LCCs, 

5.  As  a measurement  tool  for  the  control  of  programs:  • 

Cost  estimates  provide  "yardsticks"  to  measure  the  predicted 
program  costs  versus  the  actual  program  costs. 

The  Air  Force  is  utilizing  cost  estimating  models  to  estimate 
both  weapon  system  development  costs  and  depot  level  overhaul  costs 
which  will  be  included  as  components  of  LCC  estiniatcs  of  developing 
weapon  systems.  Cost  estimates  are  made  by  developing  cost  estimat- 
ing relationships  (CER)  from  cost  data,  physical  characteristics,  and 
operating  parameters  of  existing  weapon  systems. 

CERs  are  mathematical  models  that  describe  the  cost  of  an 
item  or  of  an  activity  as  a function  of  one  or  more  independent  vari- 
ables. To  be  useful,  CERs  should  have  the  following  characteristics 
(31:Ch.3): 

1.  They  should  be  relatively  easy  to  use. 

2.  They  should  be  relatively  inexpensive  so  that  the  cost  of 
development  is  held  to  a minimum. 

3.  They  should  have  minimal  data  requirements  since  they 
are  usually  developed  and  tested  during  periods  when  little  is  known 
about  the  systems  for  which  the  CERs  are  being  developed. 

Although  CERs  are  used  to  predict  costs  of  new  or  develop- 
ing systems,  there  are  limitations  to  their  use.  For  example,  CERs 
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should  generally  not  be  applied  to  estimate  costs  of  radically  new 
systems  since  there  will  be  little  prior  knowledge  from  which  to  make 
meaningful  estimates.  Additionally,  since  CERs  are  developed  from 
minimal  data,  periodic  adjustments  may  be  required  to  allow  for 
changes  in  economic  trends,  design,  and  maintenance  policy  (31:Ch.  3). 
Since  each  cost  estimate  has  its  own  conceptual  ground  rules,  assump- 
tions, and  criteria,  each  is  prepared  for  a specific  purpose  at  a par- 
ticular time  within  a system's  acquisition  cycle.  For  these  reasons, 
the  development  of  an  all-purpose  CER  may  not  be  feasible  (22:3). 

Two  basic  types  of  cost  estimating  procedures  used  by  the 
Air  Force  are  (8:465): 

1.  the  traditional  industrial  engineering  approach  to  cost 
estimating  which  relies  on  detailed  amalysis  of  the  system 
being  developed,  and 

2.  the  parametric  cost  estimating  procedure  which  makes 
use  of  historical  cost  data  from  existing  systems  that  are 
considered  to  be  similar  in  design  and  level  of  technology 
to  the  developing  system. 

THE  INDUSTRIAL  ENGINEERING  METHOD 

The  industrial  engineering  approach  to  cost  estimating  relies 
on  the  availability  of  detailed  information  about  the  system  under 
development.  It  requires  the  use  of  all  the  costs  associated  with 
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those  elements  of  the  developing  system  which,  when  combined,  will 
form  the  final  product  (8:153-157).  It  has  the  advantage  of  being  able 
to  be  tailored  to  a specific  system  and  often  to  a specific  contractor. 

Its  major  strength  is  the  accuracy  of  the  estimate  it  provides. 

In  order  to  utilize  the  industrial  engineering  method,  certain 
facts  about  the  developing  system  must  be  known:  generally,  the  costs 
of  direct  and  indirect  labor  and  material  which  will  be  required  to  pro- 
duce the  system.  Because  of  the  level  of  detail  required,  this  method 
is  difficult  and  expensive  to  use  (8:153-157).  An  additional  drawback 
is  that  as  cost  estimates  for  various  components  of  the  weapon  system 
pass  through  succeeding  levels  of  management,  the  estimates  run 
. the  risk  of  becoming  inflated  through  failure  to  identify  the 
[subjective]  contributions  of  managers  at  each  level  of  summation 
[8:157].” 

THE  PARAMETRIC  METHOD 

The  parametric  method  of  cost  estimating  relies  upon  func- 
tional relationships  between  the  cost  of  the  alternative  and  the  various 
characteristics  or  parameters  of  the  alternative.  The  results  of  a 
parametric  estimate  depend  directly  upon  the  analyst's  ability  to 
identify  the  cost  influencing  characteristics  or  parameters  of  alterna- 
tives and  to  use  these  characteristics  or  parameters  to  develop  CERs, 
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A notable  strength  of  the  paraiinetric  method  is  that  this 
approach  can  provide  cost  estimates  during  the  conceptual  stage  of 
the  acquisition  process  before  detailed  engineering  plans  are  avail- 
able. Parametric  estimates  predict  cost  by  developing  mathematical 
relationships  among  the  variables,  e.  g. , cost  and  performance 
characteristics  and/or  physical  characteristics,  of  similar  systems 
(17:3). 

However,  in  utilizing  the  parametric  method  to  develop  cost 
estimates,  there  are  difficulties  which  may  be  encountered.  These 
include: 

1.  The  system  for  which  the  estimate  is  being  developed 
must  be  closely  related  in  performance  and  design  technology 
to  the  systems  used  to  provide  the  estimating  data  (28:9). 

2.  The  accuracy  of  the  estimate  developed  is  limited  by  the 
fact  that  physical  and  performance  characteristics  might  not 
explain  all  of  the  variance  in  the  predicted  cost  (28:9). 

3.  The  cost  estimate  and  the  CER  might  become  obsolete 
quickly  under  conditions  of  changing  technology  (8:156-157). 
Aside  from  the  major  difficulties  that  may  be  encountered 

when  utilizing  the  parametric  method  to  develop  CERs,  there  are 
advantages  to  the  use  of  this  approach: 

1.  As  previously  identified,  cost  estimation  is  possible 
early  in  the  conceptual  stage  prior  to  detailed  engineering 
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plans  being  available  (8:157). 


2.  The  use  of  past  experience  incorporates  engineering  and 
design  specification  changes  in  the  data  base  which  were  not 
identifiable  during  conceptual  development  of  the  system 
(28:10). 

3.  Cost  estimates  can  be  made  quickly  once  a CER  has  been 
developed  (28:10). 

RESEARCH  OBJECTIVE  AND  HYPOTHESIS 

Present  USAF  jet  engine  cost  estimating  relationships  for 
predicting  jet  engine  depot  level  overhaul  costs,  utilize  total  engine 
depot  level  overha'il  cost  as  the  dependent  variable.  However,  other 

than  the  engine  physical  characteristic  "engine  dry  weight"  the  inde- 

• • 

pendent  variables  used  by  the  AFLC  to  develop  CERs  published  in 
AFLC  Pamphlet  173-4  (Appendix  A- 1)  are  engine  operating  parameters 
(23). 

Total  engine  depot  level  overhaul  cost  includes  both  engine 
accessory  overhaul  cost  and  core-engine  overhaul  cost  (23:4-6). 
However,  since  the  decision  to  overhaul  an  engine  accessory  is  made 
independently  of  a decision  to  overhaul  a core-engine,  only  the  over- 
haul costs  for  jet  core-engines  were  considered  in  this  study. 
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Reaearch  Obiective 

The  objective  of  this  research  effort  was  to  develop  depot 
level  cost  estimating  relationships  for  USAF  jet  core-engines  utilizing 
both  engine  operating  parameters  and  physical  engine  characteristics. 

Research  Hypothesis 

It  was  hypothesized  that  cost  estimating  relationships  may  be 
developed  to  predict  USAF  core-engine  depot  level  overhaul  cost 
utilizing  both  engine  operating  parameters  and  physical  engine  charac- 
teristics. 


CHAPTER  II 


RESEARCH  METHODOLOGY 

The  research  methodology  employed  in  this  study  is  designed 
to  develop  an  acceptable  cost  estimating  relationship  (CER)  for  use  as 
a predictor  of  depot  level  overhaul  costs  for  military  jet  core-engines. 
The  "core-engine,"  i.  e. , the  basic  engine  without  engine  accessories, 
represents  the  primary  area  of  concern. 

DATA  COLLECTION 

• Variables  to  Be  Considered 

The  dependent  variable  (Y)  used  in  this  study  denotes  the 
depot  level  core-engine  overhaul  cost  for  military  jet  engines;  here- 
after referred  to  as  overhaul  cost.  ^ The  independent  variables  (Xj^'s) 
utilized  are  engine  performance  parameters  and  engine  physical 
characteristics  which  were  considered  potential  engine  overhaul  cost- 
drivers.  Each  of  the  independent  variables  is  identified  in  Appendix 
A-2. 

^The  dependent  variable  utilized  throughout  this  study  was 
expressed  as  a core-engine  overhaul  cost/flight  hour.  "COST" 
may  be  freely  substituted  throughout  this  study  in  place  of  the  term 
"overhaul  cost." 
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These  were  considered  potential  cost-drivers  based  on  a 
CER  development  study  (3),  an  interview  with  Air  Force  Aero  Pro- 
pulsion Laboratory  (AFAPL)  engineers  (16),  and  a pilot  study  con- 
ducted by  the  researchers  concerning  CERs  (4). 

The  Air  Force  CER  development  study  to  determine,  among 
other  objectives,  the  best  method  of  cost  estimating  jet  engine  produc- 
tion costs  concluded  that  the  use  of  jet  engine  physical  characteristics 
(e.g. , rotors,  stators,  blades,  etc.)  appear  to  be  the  best  independent 
variables  for  consideration  in  CERs  to  estimate  production  costs. 

The  problem  with  developing  cost  estimates  for  engine  production 
" . . . stems  [basically]  from  che  fact  that  all  engines  are  not  simply 
scale  ups  or  scale  downs  of  a single  basic  design  . . . [3:22]."  The 
more  imaginative  the  designers  become,  the  more  difference  there 
will  be  from  one  engine  to  the  next.  However,  even  though  engines 
may  vary  significantly  in  materials  and  technology,  at  the  component 
level,  all  jet  turbine  engines  have  rotors,  blades,  and  stators;  other- 
wise they  would  be  something  else,  e.g.,  ramjets  or  turborockets 
(3:21-24). 

As  previously  discussed  in  Chapter  I,  present  CER  models 
for  predicting  USAF  jet  engine  overhaul  cost  utilize  exclusively, 
except  for  the  one  physical  characteristic,  engine  dry  weight,  inde- 
pendent variables  which  are  engine  performance  parameters.  Based 
upon  a review  of  the  USAF/Air  Force  Logistics  Command's  (AFLC) 


CERs  for  jet  engine  overhaul,  the  researchers  felt  that  engine  physi- 
cal characteristics  should  also  be  included  as  independent  variables 
(23).  All  jet  turbine  engines  have  certain  basic  physical  character- 
istics in  coxruncn  which  include: 

1.  A compressor  assembly  comprised  of  rotors,  stators, 
and  blades. 

2.  A turbine  assembly  comprised  of  rotors,  stators,  and 
blades. 

3.  A particvilar  design  of  the  combustion  section,  i.  e. , 
annular,  semi-annular,  can-annular. 

Data  Sample 

Cost  Data.  Cost  data  for  this  study  were  provided  by  the  AFLC's 
ICAD.  The  dependent  variable  referred  to  as  COST  represents  the 
average  jet  core-engine  depot  level  overhaul  cost  per  flight  hour. 
These  data  were  accumulated  for  the  engines  listed  in  Appendix  A- 3 
and  represent  fiscal  years  1974,  1975  and  1976.  Cost  data  were  con- 
verted to  equivalent  1976  dollars  using  price  indices  for  engine  costs 
from  Air  Force  Regulation  (AFR)  173-10. 

Engine  Sample.  The  engine  sample  consisted  of  the  USAF  type/model/ 
series  jet  engines  contained  in  Appendix  A-3.  As  discussed  in  the 
Research  Objective  and  Hypothesis  section  of  Chapter  I,  present  USAF 

jet  engine  CERs  published  by  the  AFLC  considered  primarily  engine 
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operating  parameters  as  the  independent  variables.  Since  the  research 
hypothesis  of  this  study  was  that  CERs  which  considered  both  engine 
operating  parameters  and  engine  physical  characteristics  would  pro- 
duce better  cost  models,  the  engine  sample  utilized  by  the  AFLC's 
ICAD  to  develop  models  published  in  AFLC  Pamphlet  173-4  was 
adopted  for  comparative  purposes. 

Summary  depot  level  core-engine  overhaul  COST  was 
obtained  from  the  AFLC  ICAD  by  type/model/series  engine  and  was 
used  in  a pilot  study  conducted  by  the  researchers  (4).  The  operating 
parameters  input  as  independent  variables  in  the  research  pilot  study 
were  incorporated  in  this  study  and  the  number  of  physical  character- 
istics used  as  independent  variables  for  each  engine  was  increased. 

The  increase  resulted  from  discussions  with  AFAPL  personnel  (1). 
These  additional  characteristics  were  obtained  from  the  illustrated 
parts  breakdown  manual  and  overhaul  manuals  for  each  specific  engine. 
Physical  engine  characteristics  which  could  not  be  located  in  either  of 
the  above  sources  were  obtained  from  the  Air  Force  Aero  Propulsion 
Laboratory  (AFAPL). 

Within  the  engine  sample,  the  data  were  grouped  so  that  80 
percent  of  the  engine  data  within  each  category  was  utilized  to  develop 
CER  models  and  the  remaining  20  percent  of  the  engine  data  was  with- 
held and  used  strictly  for  testing  the  predictive  capability^  of  the 

^A  discussion  of  predictive  capability  will  be  presented  later 
in  the  discussion. 


16 


developed  CER  models  (13:38). 


MODEL  JUSTIFICATION 

The  objective  was  to  develop  CERs  which  were  as  uncompli- 
cated as  possible  so  that  a cost  estimate  could  be  derived  with  mini- 
mum effort  and  time  and  at  a minimum  cost  to  the  organization. 
Results  of  a pilot  study  conducted  indicated  that  combinations  of  engine 
operating  parameters  and  engine  physical  characteristics  could  be 
used  to  develop  such  CERs  and  that  the  methodology  proposed  herein 
would  be  most  appropriate  (4). 

MODEL  DEVELOPMENT 

The  CER  models  developed  were  relationships  between  one 
or  more  jet  engine  operating  parameters  and/or  jet  engine  physical 
characteristics  and  overhavil  costs.  To  find  an  acceptable  CER, 
various  CER  models  were  developed  using  regression  analysis  tech- 
niques. 
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data.  (33:Ch.  13).  The  pilot  study  performed  by  the  researchers 
supports  this  assumption.  (4).  The  assumption  that  historical  data  is 
indicative  of  and  can  be  utilized  to  predict  future  occurrences  is 
critical  when  using  regression  analysis  as  a technique  for  developing 
a predictive  model. 

Begression  analysis  is  applicable  to  any  finite  number  of 
independent  variables  (Xj^'s)  if  those  variables  can  be  assumed  to 
have  the  qualities  necessary  to  apply  regression  analysis.  Those 
qualities  are  listed  as  assumptions  of  regression  analysis  in 
Appendix  B. 

Simple  linear  regression  (SLR)  involves  the  regression  of 
a dependent  variable  (Y)  on  a single  independent  variable  (X). 

Multiple  linear  regression  (MLR)  is  an  extension  of  SLR  where  Y is 
regressed  on  two  or  more  X^'s.  The  forward  stepwise  inclusion  MLR 
of  the  Statistical  Package  for  the  Social  Sciences  (SPSS)  was  used  for 
CER  model  development  (15:345-347).  The  SPSS  forward  inclusion 
MLR  allows  the  user  to  specify  the  values  of  the  three  criteria  just 
mentioned  or  to  allow  the  package's  built-in  default  criteria  of 
F = .01,  tolerance  = .001,  and  maximum  number  of  variables  = 80 
to  be  operative.  In  this  study,  the  SPSS  default  criteria  were  employed. 
All  variables  not  considered  for  inclusion  in  the  regression  model  by 
the  SPSS  forward  inclusion  method  were  considered  qualitatively  by 
the  researchers  with  respect  to  size  of  the  sample  and  to  the  possible 
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effects  of  multicollinearity.  ^ For  a further  discussion  of  regression 
analysis  refer  to  Appendix  B. 

Forward  inclusion  is  a method  used  in  MLR  for  developing 
a regression  equation.  The  independent  variables  included  in  the 
equation  are  included  in  the  order  of  the  highest  contribution  to  explain- 
ing the  variation  of  the  regression  model.  The  process  continues  until 
one  or  more  of  the  following  criteria  are  no  longer  met: 

1.  none  of  the  F values  (SPSS  default  criterion)  of  the 
remaining  variables  exceeds  the  minimum  acceptable  F 
value, 

2.  the  tolerance  criterion  is  not  met, 

3.  the  limit  on  the  maximum  number  of  variables  to  be 
included  is  exceeded. 

Correlation  Analysis 

The  dependent  variable  (Y)  was  regressed  on  various  combi- 
nations of  the  independent  variables  (X^'s).  The  determination  of 
which  independent  variables  were  considered  for  inclusion  in  the 
regression  model  was  based  on  an  analysis  of  the  correlation  coef- 
ficient (r)  between  each  possible  pair  of  independent  variables.  Only 


^Multicollinearity  is  the  situation  in  which  there  is  a very 
high  degree  of  correlation  (approximately  0.90  or  higher)  between 
independent  variables.  As  the  correlation  between  independent  vari- 
ables approaches  unity  (representing  a perfect  linear  relationship), 
the  regression  coefficients  become  less  valuable  for  predictive  pur- 
poses (5:616). 
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one  of  any  pair  of  independent  variables  was  considered  if  the  r value 
was  greater  than  .80  (14:407).  The  inclusion  of  succeeding  independ- 
ent variables  into  the  CEHs  being  developed  was  influenced  by  analysis 
of  the  values  of  first  order  partial  correlations. 

Correlation  analysis  is  designed  to  determine  the  extent  of 
the  linear  relationship  between  variables.  Correlation  analysis  can 
point  out  spurious^  as  well  as  valid  useful  relationships,  however,  a 
correlation  from  chance  causes  may  be  found  between  variables  which 
may  have  no  logical  relationship  to  each  other.  The  coefficient  of 
correlation  (r)  will  take  on  a value  between  -1  and  +1.  A value  of  +1 
indicates  a perfect  linear  positive  or  proportional  relationship  between 
two  variables.  A value  of  -1  also  indicates  a perfect  linear  relation- 
ship between  two  variables;  however,  the  relationship  is  negative  or 
inverse.  A correlation  value  of  zero  indicates  no  linear  relationship 
between  the  variables  correlated.  It  must  be  noted,  however,  that 
correlation  does  not  necessarily  imply  that  a cause -and -effect  rela- 
tionship exists  (5:542-558;  15:276-280;  10;  34). 

Correlation  between  any  two  variables  is  called  bivariate, 
simple  correlation,  or  zero^^  order  partial  correlation.  The  coef- 
ficient of  correlation  of  a bivariate  correlation  is  represented  herein 
as  r^. 


^A  spurious  relationship  between  two  variables  is  indicated 
when  the  value  of  their  first  order  partial  correlation  coefficient  is 
some  measure  lower  than  the  value  of  their  bivariate  correlation 
coefficient  (9:317). 
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First  order  partial  correlation  measures  the  relative  impor- 
tance of  each  of  the  independent  variables  separately  in  explaining 
variations  in  the  dependent  variable  while  at  the  same  time  controlling 
for  the  variations  in  another  independent  variable.  Computations  of 
first  order  partial  correlation  (represented  herein  as  r^)  may  be  per- 
formed to  measure  the  extent  to  which  the  variation  in  the  dependent 
variable,  which  was  not  explained  by  all  the  other  independent  vari- 
ables, is  explained  by  the  addition  of  the  new  independent  variable 
(9:317;  34). 

Correlation  Evaluation  Rule  1.  Bivariate  and  first  order  partial  cor- 
relation analysis  was  performed  on  each  of  the  engine  categories 
listed  in  Appendix  A-4.  To  aid  the  zinalysis  of  bivariate  correlation 
coefficients,  a bivariate  coefficient  of  correlation  value  of  r^  > ^.7000 
was  arbitrarily  established  as  an  initial  criterion  to  select  those  inde- 
pendent variables  which  appeared  to  have  the  strongest  linear  corre- 
lation with  COST.  This  criterion  is  herein  referred  to  as  Rule  1. 

Correlation  Evaluation  Rule  2.  A consideration  in  choosing  variables 
for  developing  a CFR  is  to  exclude  one  of  a pair  of  independent  vari- 
ables which  exhibits  a high  r^  between  them.  This  action  is  taken  to 
avoid  the  problems  of  multicollinearity.  It  was  arbitrarily  deter- 
mined that  if  a bivariate  correlation  coefficient  between  any  two 
independent  variables  was  equal  to  or  greater  than  .8000,  the  use 
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of  these  two  variables  together  in  an  MLK  model  would  introduce  the 
effects  of  multicollinearity.  To  aid  in  the  selection  of  variables  to  be 
used  in  ein  MLR,  the  researchers  defined  the  coefficient  of  correla- 
tion ranges  between  paired  independent  variables  as  follows: 

Range  of  Tq  Meaning  Action 

0 to  i .50  Little  or  no  linear  corre-  Regress  both  variables 

^ lation  between  the  paired  with  COST, 

variables. 

+ . 50  to  ^.79  Moderate  linear  correla-  Regress  one  of  the 

tion  between  the  variables.  variables  with  COST. 

If  the  number  of  vari- 
ables available  for 
regression  is  small, 
an  arbitrary  judgment 
as  to  the  inclusion  of 
the  second  variable  is 
appropriate. 

+ . 80  to  i 1.00  High  linear  correlation  Regress  only  one  of  the 

between  the  variables.  pair  with  COST. 

This  bivariate  coefficient  of  correlation  evaluation  criterion  is  referred 
to  as  Rule  2 hereaifter. 


Correlation  Evaluation  Rule  3.  To  aid  the  analysis  of  first  order  cor- 
relations, searching  for  spurious  bivariate  correlations,  the 
researchers  defined  the  following  heuristic  rule  hereafter  referred  to 
as  Rule  3: 

1.  If  the  absolute  value  of  r^  is  less  than  the  absolute  value 
of  r^  minus  .2000,  the  bivariate  correlation  value  would  be 
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considered  to  represent  a spurious  relationship  due  to  the 


effects  of  the  rj^  control  variable. 

2.  If  the  algebraic  sign  of  the  r^  value  was  opposite  the  sign 
of  the  rj^  value,  the  r^  value  would  be  considered  to  repre- 
sent a spurious  relationship  due  to  the  effects  of  the  r^^  con- 
trol variable. 

The  application  of  heuristic  Rules  1,  2,  and  3 aided  the 
researchers  in  the  analysis  of  bivariate  correlations  between  the 
dependent  variable  COST  and  the  independent  variables  listed  in 
Appendix  A-2,  the  analysis  of  bivariate  correlation  between  pairs  of 
independent  variables,  and  the  first  order  correlation  of  COST  with 
each  independent  variable. 

Model  Categorization 

The  CERs  which  have  been  developed  by  the  Independent  Cost 
Analysis  Division  (ICAD)  of  the  AFLC  categorized  jet  engines  as 
follows  (23); 

1.  TURBOJETS 

2.  TURBO  FANS 

However,  as  a result  of  the  pilot  study  conducted,  it  was 
concluded  that  there  may  be  other  categorizations  of  jet  engines  which 
might  produce  statistically  significant  models  and  provide  a more 
accurate  estimate  of  jet  engine  overhaul  cost.  The  additional  cate- 


gories identified  were: 
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1.  ALL  ENGINES 

2.  TURBOJETS  (with  afterburner) 

3.  TURBOJETS  (without  afterburner) 

4.  TURBOFANS  (with  afterburner) 

5.  TURBOFANS  (without  afterburner) 

MODEL  VALIDATION 

Regression  models  constructed  were  subjected  to  standard 
statistical  tests  to  determine  the  statistical  significance  and  accepta- 
bility of  the  model.  F-tests  were  performed  on  the  overall  regression 
model  and  on  the  net  contribution  of  the  individual  independent  vari- 
ables (Xj^'s)  to  the  explained  variation.  The  Xj^'s  were  tested  simul- 
taneously using  the  Bonferroni  technique  to  compute  equivalent-alpha 
(a)  values  (14:257).  The  coefficient  of  determination  (R^)  and  the 
standard  error  of  the  estimate  were  also  considered  for  model  accept- 
ability. Confidence  intervals  were  constructed  for  each  model  at  the 
95  percent  confidence  level  to  test  the  predictive  capability  of  the 
model  and  as  an  indicator  of  its  utility.  If  the  overall  model  was 
found  to  be  significant  at  the  95  percent  confidence  level,  and  the  coef- 
ficient of  determination  (R^)  was  greater  than  . 8000, an  arbitrary 
choice  (14:92),  the  model  was  considered  acceptable.  Acceptance 
criteria  for  the  standard  error  of  the  estimate  and  the  accuracy  and 
reliability  of  the  model's  predictive  ability  were  not  established  since 
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neither  the  ICAD  nor  the  AFAPL  have  established  a minimum  accept- 
ance level  for  these  criteria  (1;  18;  32). 
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Predictive  Capability 

Cost  model  predictability  was  examined  for  each  CER 
developed.  Once  a model  had  been  developed  using  80  percent  of  the 
sample  data  and  met  the  and  confidence  level  criteria  previously 
discussed,  an  average  percent  predictability  was  computed.  Using  a 
test  engine,  the  average  percent  predictability  was  computed  by  divid- 
ing the  absolute  numerical  difference  between  the  test  engine's  actual 
COST  and  predicted  COST  by  the  actual  COST: 

actual  COST  - predicted  COST  ^ i nn 
actual  COST  ^ 

The  maiximum  acceptable  predictability  level  was  established  by  the 
researchers  as  50.0  percent. 

SUPPORT  OF  RESEARCH  HYPOTHESIS 

If  the  CER  model(s)  met  the  criteria  for  statistical  signifi- 
cance and  acceptability  presented  in  the  Model  Validation  section,  the 
research  hypothesis  was  considered  supported  and  the  CERs  developed 
were  considered  acceptable. 


/ 
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SUMMARY  LIST  OF  ASSUMPTIONS 

1.  The  independent  variables  chosen  are  cost-drivers  with 
respect  to  overhaul  cost  and  that  they  are  logically  supportable. 

2.  The  relation  between  the  dependent  variable  and  independ- 
ent variables  is  linear. 

3.  Regression  analysis  is  an  appropriate  modeling  technique. 

4.  Historical  data  is  indicative  of  and  can  be  utilized  to  pre- 
dict future  occurrences. 

5.  The  independent  variables  satisfy  the  assumptions  of 
regression  analysis. 

SUMMARY  LIST  OF  LIMITATIONS 

1.  CFRs  generally  cannot  be  used  to  estimate  costs  of 
radically  different  new  systems  since  CERs  are  developed  from  his- 
torical experience  (11:7). 

2.  The  accuracy  of  the  cost  data  is  limited  by  the  accuracy 
of  the  reporting  system. 

3.  The  researchers  were  uncertain  as  to  whether  the  inde- 
pendent variables  identified  and  used  to  develop  jet  core-engine  over- 
haul CERs  represented  a complete  list  of  all  possible  cost  drivers. 

4.  Complete  data  records  were  not  available  for  all  engines. 

5.  The  independent  variables  chosen  were  not  totally  applic- 
able to  each  engine  category  identified  for  investigation. 
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CHAPTER  UI 


DATA  ANALYSIS  AND  FINDINGS 


INTRODUCTION 


Chapter  III  presents  the  analysis  and  findings  of  the  research. 


which  was  conducted  to  develop  parametric  cost  estimating  relation- 


ships^ (CERs),  for  predicting  core-engine  depot  level  overhaul  costs. 


for  USAF  jet  engines.  Initial  discussion  will  focus  on  the  independent 


variables  utilized  in  the  study  and  the  analysis  and  findings  of  a bivar- 


iate and  first  order  partial  correlation  analysis  which  was  performed. 


Following  the  presentation  of  the  findings  of  the  correlation 


analysis,  a two-part  discussion  of  the  auialysis  and  findings  of  the 


CERs  which  were  developed  is  provided.  The  first  part  of  the  dis- 


cussion, pertaining  to  CER  development  will  focus  on  the  models 


developed,  an  analysis  of  the  coefficient  of  determination  (R^),  confi- 


dence level,  and  percent  predictability  obtained  for  each  engine  cate- 


gory model.  The  second  part  will  cover  an  analysis  of  the  distribu- 


tion of  R^  values  for  models  developed  in  each  engine  category  and 


an  analysis  of  cost  models  developed  by  use  of  a modified  multiple 


^Throughout  this  chapter,  the  terms  "cost  model"  or  "model" 
should  be  interpreted  to  mean  cost  estimating  relationship. 
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linear  regression  (MLR)^  program.  The  emphasis  of  this  discussion 
is  on  the  expected  value,  the  average  percent  predictability,  and 
average  percent  efficiency  computed  for  cost  models  derived  for  each 
of  the  engine  categories:  ALL  ENGINES,  TURBOJETS,  and 
TURBOFANS. 

INDEPENDENT  VARIABLE  LIMITATIONS 


The  independent  variables  identified  for  use  in  this  study  are 
identified  in  Appendix  A-2.  The  assumption  made  was  that  the  inde- 
pendent variables  chosen  were  cost-drivers  of  core- engine  overhaul 
cost  and  that  their  utilization  in  a CER  to  predict  this  cost  could  be 
logically  supported.  However,  limitations  regarding  the  independent 
variables  were  uncovered  once  the  data  collection  effort  was  complete. 
First,  it  was  noted  that  complete  data  records  were  not  available  for 
all  engines.  The  two  independent  variables  that  contributed  to  the 
lack  of  complete  data  records  were  variable  X14  (NORMTIT)  and 
variable  X15  (MILTIT).  Complete  data  elements  for  these  variables 
were  not  obtained  because,  depending  on  the  specific  engine,  either 
NORMTIT  or  MILTIT  ratings  were  not  published.  Because  of  the 
inability  of  the  researchers  to  collect  data  on  these  two  independent 
variables  for  all  engines,  these  variables  were  not  considered  in  the 
development  of  any  CER. 
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The  second  limitation  noted,  concerning  the  independent  vari- 


ables, related  to  whether  a specific  independent  variable  should  be 
considered  in  the  development  of  a specific  CER.  It  was  found  that 
the  independent  variable  X13  (MAXT)  had  been  obtained  only  for  those 
engines  which  are  equipped  with  afterburners.  Therefore,  variable 
X13  was  considered  only  in  model  categories  (1)  TURBOFANS  (with 
afterburner)  and  (2)  TURBOJETS  (with  afterburner).  Variable  X19 
(MAXSFC)  followed  the  same  pattern  as  variable  X13,  i.  e.,  a data 
element  was  obtained  for  variable  X19  for  only  those  engines 
equipped  with  an  afterburner.  Consequently,  variable  X19  was  only 
considered  for  models  developed  for  TURBOFANS  (with  afterburner) 
and  TURBOJETS  (with  afterburner).  Variable  X22  (FBR)  is  unique  to 
turbofan  engines.  Consequently,  it  was  considered  in  CERs  for 
TURBOFANS,  TURBOFANS  (with  afterburner),  and  TURBOFANS 
(without  afterburner). 

Appendix  C-1  lists  the  independent  variables  considered  in 
CER  development  by  engine  category.  Prior  to  the  initial  develop- 
ment of  CERs  using  the  MLR  technique,  the  independent  variables  to 
be  considered  in  each  engine  category  were  subjected  to  bivariate  and 
first  order  partial  correlation  analysis. 

CORRELATION  ANALYSIS 

Bivariate  correlation  (r^)  analysis  between  the  dependent 

variable  COST  and  the  independent  variables,  and  between  each  pair 
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of  independent  variables  was  performed  by  engine  category.  Addi- 
tionally, first  order  partial  correlation  (r^)  analysis  was  performed 
between  the  dependent  variable  COST  and  each  independent  variable 
that  met  the  Rule  1 criterion^  defined  in  Chapter  n.  However,  due  to 
the  small  number  of  engines  included  in  four  of  the  seven  engine  cate- 
gories as  shown  in  Appendix  A-4,  correlation  analysis  was  limited  to 
the  categories  ALL  ENGINES,  TURBOJETS,  and  TURBOFANS. 

ALL  ENGINES 

The  engines  included  in  the  correlation  analysis  for  the 
category  ALL  ENGINES  are  listed  in  Appendix  C-2.  1.  Of  the  22 
independent  variables  considered  (Appendix  C-1)  in  this  category  as 
possible  core-engine  cost  drivers,  no  independent  variable  displayed 
a bivariate  coefficient  of  correlation  with  the  dependent  variable 
COST  equal  to  or  greater  than  + .7000.  Therefore,  the  minimum 
level,  Tq,  for  Rule  1,  was  arbitrarily  lowered  first  to  . 6000  and 
then  to  ^ . 5000.  At  the  ^ .6000  level,  two  of  the  22  independent 
variables,  X6  (L)  and  X25  (AFHBO),  displayed  r^  values  that  met  or 
exceeded  the  minimum  requirement.  At  the  5000  level,  three 

additional  variables,  X8  (LD),  Xl6  (MAXTIT),  and  X23  (EPR),were 

^Extensive  reference  is  made  throughout  the  Correlation 
Analysis  section  of  this  chapter  to  "Rules  1,  2,  and  3."  These 
"Rules"  were  introduced  in  the  Correlation  Analysis  section  of 
Chapter  11  to  assist  the  researchers  in  evaluating  the  results  of 
bivariate  and  first  order  partial  correlation  analysis. 
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identified  as  possibly  having  a meaningful  linear  relationship  u^ith 
COST.  Appendix  C-3  lists  the  five  independent  variables  that  dis- 
played a bivariate  coefficient  of  correlation  with  COST  equal  to  or 
greater  than  + . 5000. 

Of  the  five  variables  which  met  the  modified  Rule  1 minimum 
of  . 5000,  variable  X6  exhibited  the  highest  direct  correlation  with 
COST  (.6721)  and  the  variable  X25  displayed  an  inverse  r^  relation- 
ship with  COST  of  -.6278.  The  remaining  three  variables  (X8,  X16, 
and  X23)  displayed  r^  values  of  .5235,  .5484,  and  .5847  respectively. 
Appendix  C-4  provides  a tabular  summary  of  the  bivariate  coef- 
ficients of  correlation  between  the  five  variables  mentioned  above 
which  met  the  modified  Rule  1 r^  criterion. 

Rule  2 was  used  to  classify  the  strength  of  the  linear  relation- 
ship between  independent  variables.  Of  the  10  pairs  of  independent 
variables  analyzed,  one  pair,  X16  (I^IAXTIT)  with  X23  (EPR),  was 
classified  as  having  a high  linear  relationship.  Five  pairs  of  inde- 
pendent variables,  X8  (liO)  with  X6  (L),  X6  with  X16,  X6  withX23, 

X6  with  X25  (AFHBO),  and  X16  with  X25,  were  classified  as  having 
a moderate  linear  relationship,  and  four  pairs,  X8  with  X-16,  X8 
with  X23,  X8  with  X25,  and  X23  with  X25,  were  classified  as  having 
no  meaningful  linear  relationship  between  the  paired  variables  (refer 
to  Appendix  C-4). 
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A first  order  partial  correlation  analysis  was  performed 
with  each  bivariate  combination  of  the  dependent  variable,  COST,  and 
the  five  independent  variables  which  met  the  modified  Rule  1 criter- 
ion. Appendix  C-5  presents  the  results  of  that  r^^  correlation.  The 
researchers  were  interested  in  determining  those  r^  correlations 
with  the  dependent  variable  which  could  be  shown  by  first  order 
partial  correlation  analysis  to  be  spurious  relationships.  To  accom- 
plish this  analysis,  the  evaluation  criteria  of  Rule  3,  as  defined  in 
Chapter  n,  was  applied. 

X6  (L) 

The  first  order  partial  correlation  of  X6  with  COST,  con- 
trolling for  each  other  independent  variable,  is  shown  in  Appendix 
C-5.  The  Tq  value  of  X6  with  COST  was  computed  to  be  .6721. 
Appendix  C-5  indicates  that  this  bivariate  correlation  is  upheid  by 
first  order  partial  correlation  when  controlling  for  all  other  vari- 
ables except  X23  (EPR)  and  X25  (AFHBO).  Both  X23  and  X25  are 
among  those  independent  variables  initially  selected  as  having  the 
best  linear  correlation  with  COST  (rQ>  _+  . 5000). 

X8  fLD) 

The  bivariate  correlation  coefficient  of  X8  and  COST  (.  5235) 
was  found  to  be  supported  by  all  control  variables  except  X6  (L)  and 
X25  (AFHBO).  The  variables  X6  and  X25  were  among  those  variables 
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which  met  che  modified  Rule  1 criterion 


X16  (MAXTIT) 

The  first  order  partial  correlation  of  X16  with  COST,  con- 
trolling for  the  effects  of  all  other  independent  variables,  indicated 
that  the  r^  value  of  . 5484  was  supported  by  all  variables  except 
X6  (L),  X23  (EPR),  and  X25  (AFHBO).  Variables  X6,  X23,  and  X25 
are  among  those  independent  variables  that  were  considered  to  have 
the  best  linear  correlation  with  COST. 

X23  (EPR) 

Analysis  of  the  first  order  partial  correlation  of  X23  with 
COST,  while  controlling  for  all  other  variables,  depicts  the  same 
pattern  evidenced  by  the  rj^  vadues  of  X6  (L),  X8  (LD),  and  Xl6 
(MAXTIT),  That  is,  the  first  order  partial  correlation  coefficient  of 
X23  with  COST  was  supported  by  all  control  variables  except  those 
control  variables  which  met  the  modified  Rule  1 criterion. 

X25  (AFHBO) 

The  pattern  evidenced  throughout  the  analysis  of  first  order 
partial  correlations  of  Rule  1 variables  has  indicated  that  variables 
that  displayed  the  best  bivariate  correlation  with  COST  do  not  sup- 
port bivariate  correlations  of  other  variables  with  COST  when  used 
as  control  variables.  The  first  order  partial  value  of  X25  (-.6278) 
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is  upheld  when  controlling  for  all  other  variables  except  X6  (L)  and 
X16  (MAXTIT). 
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Summary  of  ALL  ENGINES  Correlation  Analysis 

The  results  of  the  first  order  partial  correlation  analysis  of 
Rule  1 variables  suggests  that  the  bivariate  correlation  of  each  inde- 
pendent variable  with  COST  is  a valid  correlatiou.  Application  of 
Rule  2 in  selecting  variables  to  regress  in  an  MLR  suggested  which 
combinations  of  independent  variables  should  not  be  regressed 
together  if  the  effects  of  mvilticoUinearity  on  the  regression  equation 
are  to  be  avoided.  Appendix  C-6  presents  representative  combina- 
tions of  Rule  1 independent  variables  that  may  produce  the  simplest 
CER  while  avoiding  the  affects  of  multicoilinearity  between  the  inde- 
pendent variables. 

TURBOJETS 

Engines  included  in  the  correlation  analysis  performed  on  the 
engine  category  TURBOJETS  are  listed  in  Appendix  C-2.  2.  Of  the  22 
independent  variables  considered  (Appendix  C-l),it  was  found  that, 
similar  to  the  bivariate  correlation  analysis  performed  on  the  cate- 
gory ALL  ENGINES,  no  independent  variable  displayed  a bivariate 
coefficient  of  correlation  with  COST  greater  than  the  Rule  1 minimum 
of  ^ . 7000.  The  researchers  therefore  chose  to  arbitrarily  lower  the 

Rule  1 criterion  for  r„  from  r.  > + . 7000  to  r > + . 5000.  By 
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lowering  the  value,  seven  of  the  22  variables  correlated  with  COST 
displayed  a coefficient  of  correlation  whose  value  exceeded  the  lower 
r^  limit.  These  variables  are  listed  in  .Appendix  C-7.  Appendix  C-8 
provides  a summary  of  the  bivariate  coefficients  of  correlation 
between  each  pair  of  Rule  1 variables. 

Rule  2 was  applied  to  classify  the  degree  of  each  linear 
relationship.  Of  the  21  pairs  of  independent  variables  correlated,  12 
pairs  were  classified  as  displaying  a high  linear  correlation:  Xl  (CR) 
with  X2  (CB),  XI  with  X3  (TR),  XI  with  X8  (LD),  XI  with  X23  (EPR), 
X2  with  X3,  X2  with  X23,  X3  with  X6  (L),  X3  with  X8,  X3  with  X23, 

X6  with  X8,  X6  with  X23,  and  X8  with  X23. 

First  order  partial  correlation  was  performed  on  each  bivar- 
iate combination  of  COST  with  the  seven  variables  which  met  the 
modified  Rule  1 criterion.  Appendix  C-9  displays  the  result  of  that 
correlation.  Again,  the  objective  was  to  determine  whether  the  r^ 
correlations  depicted  in  Appendix  C-7  were  indicative  of  true  linear  " 
correlations  or  whether  they  were  the  results  of  spurious  relation- 
ships caused  by  the  action  of  the  control  variable.  Rule  2 was  applied 
to  aid  the  researchers  in  this  analysis. 

XI  (CR),  X2  (CB),  X3  (TR),  X6  (L),  X8  (LD),  and 
X23  (EPR) 

First  order  partial  correlations  analysis  of  the  listed  vari- 
ables suggested  that  the  r^  value  for  each  of  these  variables  with 
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COST  may  be  spurious.  In  each  case,  the  control  variables,  •which 


did  not  support  the  r^  correlation  for  the  listed  variables  with  COST, 
were  primarily  those  variables  whose  bivariate  correlation  with 
COST  was  equal  to  or  greater  than  ^ . 5000. 

X25  (AFHBO) 

The  first  order  partial  analysis  of  variable  X25  (AFHBO) 

suggests  that  the  r value  is  a true  indication  of  its  correlation  with 
o 

COST.  Only  two  variables  did  not  uphold  the  r^  value  of  X25.  They 
were  variables  X6  (L)  and  X8  (LD);  both  of  which  displayed  r^  values 
equal  to  or  greater  than  ^ . 5000. 

Summary  of  TURBOJET  Correlation  Analysis 

The  results  of  the  bivariate  correlation  analysis  of  the  inde- 
pendent variables  with  COST  found  that  none  of  the  independent  vari- 
ables displayed  an  rg  value  equal  to  or  greater  than  ^ . 7000.  This 
fact  caused  the  researchers  to  arbitrarily  lower  the  Rule  1 minimum" 
r^  value  to  ^ . 5000.  The  lowering  of  the  r^  value  allowed  seven  vari- 
ables to  meet  the  modified  Rule  1 criterion  for  further  first  order 
partial  correlation  analysis.  Of  the  seven  variables  analyzed,  first 
order  partial  correlation  analysis  suggested  that  only  one  variable, 
X25  (AFHBO),  exhibited  a nonspurious  bivariate  coefficient  of  corre- 
lation with  COST.  In  each  case  the  control  variables  which  did  not 
uphold  the  r^  correlation  were  those  variables  which  also  met  the 
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modified  Rule  1 criterion.  The  one  exception  was  variable  X9  (W). 

When  X9  functioned  as  the  control  variable  for  variables  XI  (CR)  and 
X3  (TR),  it  did  not  support  either  the  XI  or  the  X3  r^  values  with 
COST. 

TURBOFANS 

The  engines  included  in  the  correlation  analysis  for  the  cate- 
gory TURBOFANS  are  listed  in  Appendix  C-2.  3.  Of  the  23  independ- 
ent variables  considered  (Appendix  C-1),  14  variables  were  found  to 

display  a bivariate  coefficient  of  correlation  with  COST  above  ^ . 7000.  | 

The  independent  variables  which  met  the  Rule  1 criterion  and  which 
would  be  subjected  to  further  correlation  analysis  are  listed  in 
Appendix  C-10. 

A bivariate  correlation  was  performed  between  each  combi- 
nation of  independent  variables  whose  bivariate  correlation  with  COST 
met  the  Rule  1 criterion  (refer  to  Appendix  C-11).  Rule  2 was  applied 
to  assist  in  the  analysis.  Of  the  123  pairs  of  variables  examined,  116 
pairs  were  classified  as  having  high  linear  relationships,  7 pairs 
were  classified  as  having  moderate  linear  relationships,  and  none  of 
the  variable  pairs  examined  had  an  r^  less  than  ^ .6000.  Addi- 
tionally, one  pair  of  the  independent  variables  examined,  X6  (L)  with 
X8  (LD),  exhibited  a perfect  linear  relationship  (r^  = 1.  0000). 

Appendix  C-12  displays  the  results  of  the  first  order  partial  correla- 
tion performed  between  COST  and  each  Rule  1 variable. 
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Rule  3 was  applied  to  assist  the  researchers  in  determining 

which  r correlations  with  COST  were  true  indications  of  linear  rela- 
o 

tionships  and  not  spurious  relationships  influenced  by  the  action  of  the 
control  variable.  The  first  order  partial  correlation  analysis  indi- 
cated that  the  bivariate  correlation  of  variable  X4  (TB)  with  COST 
was  the  only  r^  generally  supported.  Two  control  variables,  how- 

1 

ever,  did  not  support  X4's  rg  with  COST:  XIO  (TW)  and  X23  (EPR).  ; 

Like  the  results  of  the  correlation  analysis  performed  on  other  engine 

categories,  the  variables  which  did  not  support  the  value  of  X4 

were  also  Rule  1 variables.  All  other  independent  variables, whose  r^ 

with  COST  was  tested  by  the  application  of  Rule  3, were  considered  to 

display  spurious  relationships  with  COST. 

Summary  of  TURBOFANS  Correlation  Analysis 

The  TURBOFANS  category  had  10  variables  which  met  the 
Rule  1 criterion  and  Il6  of  123  pairs  of  Rule  1 variables  chat  were 
considered  to  have  high  linear  relationships.  However,  although  the 
variables  of  this  category  displayed  strong  results  after  the  a _ ca- 
tion of  Rules  1 and  2,  only  one  variable,  X4  (TB),  withstood  a first 
order  partial  correlation  challenge  of  its  bivariate  correlation  value 


1 

with  COST. 

Findings  of  Bivariate  and  First  Order  Correlation  Analysis  j 
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TURBOFANS  were  performed  to  assess  the  presence  of  the  linear 
relationships  between  the  category  variables.  The  Rule  1 criterion 
defined  in  Chapter  II  established  a measure  to  classify  or  select 
those  variables  which  would  be  considered  to  have  the  most  meaning- 
ful linear  relationship  with  COST.  Of  the  three  engine  categories, 
only  the  category  TURBOFANS  produced  any  bivariate  coefficients  of 
correlation  of  the  independent  variables  with  COST  which  met  the 
Rule  1 criterion  (rQ>i.7000).  This  prompted  the  researchers  to 
arbitrarily  reduce  the  Rule  1 minimum  value  from  .7000  to 
^ .5000  for  bivariate  correlation  analysis  of  the  engine  categories 
ALL  ENGINES  and  TURBOJETS. 

Although  the  Rule  1 minimum  was  modified  downward  for  the 
engine  categories  ALL  ENGINES  and  TURBOJETS,  the  first  order 
partial  correlations  for  these  categories  supported  the  Rule  1 bivari- 
ate correlations  with  COST.  Conversely,  the  application  of  Rule  3 
criteria  to  the  first  order  partial  correlation  values  for  the  category 
TURBOFANS  did  not  support  the  bivariate  correlations  with  COST. 
The  bivariate  correlations  had  indicated  that  there  were  strong  lineai 
relationships  between  COST  and  the  independent  variables  of  this 
engine  category. 

The  res'jits  of  the  preceeding  correlation  analysis  were  con- 
sidered in  the  second  portion  of  the  cost  model  development:  variable 
selection  for  CER  development. 
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INITIAL  COST  MODEL  ANALYSIS 


As  described  in  Chapter  II,  the  multiple  linear  regression 
technique  was  utilized  to  develop  cost  estimating  relationships  for 
the  engine  categories  listed  in  Appendix  A-4.  The  CER  selected 
from  each  category  was  chosen  based  on  the  highest  coefficient  of 
determination  (R^  > .8000)  at  an  overall  model  confidence  level  of 
95  percent.  Once  the  model  was  chosen  for  each  of  the  engine  cate- 
gories, the  average  percent  predictability  for  each  model  was 
calculated  utilizing  the  test  engines  listed  in  Appendices  C-2.  1 
through  C-2.  7.  The  arbitrary  maximum  value  of  percent  predicta- 
bility a model  could  display,  and  be  considered  acceptable,  was  50.0 
percent. 

ALL  ENGINES 

The  CER  for  the  engine  category  ALL  ENGINES  was 
developed  utilizing  the  data  for  engines  listed  in  Appendix  C-2.  1 and 
the  independent  variables  indicated  in  Appendix  C-1  and  had  an 
R^  = .9003.  Appendix  C-13  lists  the  independent  variables  and  their 
coefficients  of  the  cost  model  which  met  both  the  R^  and  the  confi- 
dence level  criteria.  Even  though  the  model  chosen  was  significant 
at  the  95  percent  confidence  level,  only  one  of  the  14  independent 
variables,  Xl6  (MAXTIT),  made  a significant  marginal  contribution 
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to  the  model's  overall  explanatory  power. ^ 

Appendix  C-14  displays  the  calculation  of  the  model's  pre- 
dictive capability  using  the  test  engines.  From  Appendix  C-14  it  may 
be  observed  that  the  model  chosen  which  had  the  highest  at  a 95 
percent  confidence  level  did  not  produce  am  average  percent  predicta- 
bility within  the  acceptable  (0  to  50.0  percent)  range. 

TURBOJETS 

Application  of  the  MLR  technique  using  data  for  engines 
listed  in  Appendix  C-2.  2 did  not  produce  a CER  which  met  both  the  R^ 
and  significance  level  criteria  required  for  initial  model  selection. 
The  R^  obtained  for  a TURBOJET  model  was  .6101  (refer  to  Appendix 
C-15). 

TURBOFANS 

Appendix  C-l6  displays  the  CER  developed  for  the  engine 
category  TURBOFANS  with  an  R^  of  1. 0000.  Each  of  the  five  coef- 
ficients of  the  independent  variables  was  found  to  contribute  signifi- 
cantly to  the  model's  overall  explanatory  power. 

Appendix  C-17  displays  the  predictive  capability  of  the 
model,  which  was  calculated  by  using  data  for  the  test  engine.  The 

“^Of  the  14  variables  in  the  model  only  Xl6  appears  to  be  a 
significant  driver  of  cost,  i.  e. , XlS's  marginal  contribution  to  the 
r2  value  of  . 9003  was  the  only  contribution  to  exhibit  statistical 
significance  when  subjected  to  an  F-test, 
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overall  average  percent  predictability  of  the  model  was  calculated  to 
be  Z20.  0 percent  which  exceeded  the  50.  0 percent  maximum  estab- 
lished for  model  acceptability.  Of  the  two  test  engines  utilized,  one 
engine  (TF30-P-7)  produced  a percent  predictability  of  41.0  while 
the  other  engine  (TF39-GE-1)  produced  a percent  predictability  of 
399.  0. 

TURBOFANS  (with  Afterburner) 

Data  for  engines  shown  in  Appendix  C-2.4  were  utilized  to 
develop  the  CFR  listed  in  Appendix  C-18.  The  model  has  one  inde- 
pendent variable  and  displays  an  R^  = .9976.  The  predictive  capa- 
bility of  this  model  was  calculated  to  be  72.0  percent  (refer  to 
Appendix  C-19). 

TURBOFANS  (without  Afterburner) 

Regression  of  data  for  the  engines  listed  in  Appendix  C-2.  5 
produced  the  CER  shown  in  Appendix  C-20.  At  the  95  percent  confi- 
dence level,  this  model  displayed  an  R^  = .9999.  Additionally,  the 
coefficients  of  the  two  independent  variables  included  in  the  model 
were  found  to  contribute  significantly  to  the  model's  overall  explana- 
tory power. 

Appendix  C-21  displays  the  percent  predictability  calculated 
for  the  overall  model.  The  single  test  engine  produced  a 62.  35  per- 
cent predictability. 
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TURBOJET  (with  Afterburner) 

The  CER  listed  in  Appendix  C-22  was  developed  using  the 
data  for  engines  shown  in  Appendix  C-2.  7.  The  model  displays  an 
R^  value  of  . 9869;  however,  only  one  of  the  coefficients  of  the  four 
independent  variables  was  found  to  make  a significant  marginal  contri- 
bution to  the  model's  overall  explanatory  power.  The  overall  average 
predictive  capability  of  this  model  was  calculated  to  be  285.0  percent 
(Appendix  C-23).  However,  of  the  two  test  engines  utilized,  one 
engine  (J57-p-21)  produced  a predictability  of  25.  0 percent  and  the 
other  engine  (J85-GE-13)  produced  a predictability  of  545.  0 percent. 

Findings 

Cost  estimating  relationships  were  developed  for  each  of  the 
seven  engine  categories  using  the  MLR  procedure.  However,  of  the 
seven  models  developed,  only  five  met  the  minimum  initial  accept- 
ance criteria  of  r^^.8000  at  the  95  percent  confidence  level.  Of  the 
five  models  that  met  these  criteria,  no  model  displayed  an  overall 
average  percent  predictability  equal  to  or  less  than  the  50  percent 

maximum  set  by  the  researchers.  The  utility  of  a model  as  a pre- 

I 

dictor  of  cost  is  a subjective  determination  by  the  user  of  a model 
based  on  the  accuracy  of  a prediction  he  is  willing  to  accept.  How- 
ever, the  accuracy  and  usefulness  of  a model  as  a predictor  is 
inversely  related  to  the  absolute  value  of  the  model's  percent,  or 
average  percent,  predictability.  Appendix  C-24  provides  a summary 
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of  these  statistics. 


Although  none  of  the  cost  models  met  the  percent  predicta- 
bility criterion,  two  models  were  close.  They  were  TURBOFANS 
(with  afterburners)  and  TURBOFANS  (without  afterburners)  which 
displayed  average  overall  model  predictabilities  of  72.  0 percent  and 
62.35  percent  respectively. 

The  engine  category  TURBOFANS  displayed  a percent  pre- 
dictability of  220.0.  However,  one  of  the  two  test  engines  (TF30-P-7) 
displayed  a 41  percent  predictability  while  the  second  test  engine 
(TF39-GE-1)  displayed  a 399.0  percent  predictability. 

Of  the  three  major  engine  categories  (ALL  ENGINES, 
TURBOJETS,  and  TURBOFANS),  only  two,  ALL  ENGINES  and 
TURBOFANS,  produced  cost  models  which  met  both  the  R^  and  con- 
fidence level  criteria.  Of  the  three  TURBOJET  subcategories 
(TURBOJETS,  TURBOJETS  (with  afterburner),  and  TURBOJETS 
(without  afterburner))  only  the  category  TURBOJETS  (with  afterburner) 
produced  a CER  that  met  both  the  R^  and  confidence  level  criteria. 

All  CERs  developed  that  met  the  R^  and  confidence  level 
acceptance  criteria  were  analyzed  with  regard  to  the  makeup  of  the 
independent  variables  that  comprised  each  model;  numbers  of  engine 
operating  parameters  vs.  numbers  of  physical  engine  characteristics. 
Of  the  14  independent  variables  that  make  up  the  ALL  ENGINES  cost 
model,  10  of  14  variables  are  physical  engine  characteristics.  The 
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CER,  shown  in  Appendix  C-l6,  for  the  TURBOFANS  category  is 
comprised  of  five  independent  variables  of  which  two,  X18  (MILSPC) 
and  X27  (AFH),  are  engine  operating  parameters,  two,  X2  (CB)  and 
X4  (TB),  are  engine  physical  characteristics,  and  the  fifth  variable, 
XIO  (TW)  is  a combination  of  both  variable  categories. 

The  remaining  three  engine  CERs,  TURBOFANS  (with  after- 
burner), TURBOFANS  (without  afterburner),  and  TURBOJETS  (with- 
out afterburner)  each  had  equal  shares  of  both  operating  parameters 
and  physical  characteristic  variables.  The  exception  was  the 
TURBOFAN  (with  afterburner)  model  which  had  only  a single  inde- 
pendent variable,  X23  (EPR). 

The  indication  at  this  stage  of  the  analysis  is  that  physical 
engine  characteristics  contribute  as  well  to  a model's  overall  explan- 
atory power  as  do  engine  operating  parameters.  This  indication  is 
based  on  a simple  ratio  of  the  total  number  of  independent  variables 
in  the  five  models  discussed.  Sixty  percent  of  the  independent  vari- 
ables in  the  developed  cost  models  are  physical  engine  characteris- 
tics (Appendices  C-12,  C-14,  C-15,  C-19,  C-21). 

COST  MODEL  ANALYSIS 

The  initial  models  developed  for  the  ALL  ENGINES  and 
TURBOFANS  categories  had  R^  values  of  ,9003  and  1.0000  respec- 
tively. However,  the  R^  value  for  the  model  developed  for 
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TURBOJETS  was  only  .6101. 


1 

The  marginal  contribution  of  the  variables  to  the  model  in  ' 

the  ALL  ENGINES  and  TURBOJET  categories  were  insignificant 
except  for  the  variable  Xl6  (MAXTIT)  in  the  ALL  ENGINES  model. 

The  high  R^  values  for  the  ALL  ENGINES  and  TURBOFANS 
models,  the  low  R^  value  for  the  TURBOJETS  model,  small  sample 
sizes  for  all  categories,  and  the  insignificant  marginal  contributions 
of  the  variables  suggested  that  the  data  may  be  of  questionable  reli> 
ability,  at  least  in  the  ALL  ENGINES  and  TURBOJETS  categories.  It 
also  appeared  that  the  validity  of  the  models  developed  might  be  very 
sensitive  to  the  particular  engine  data  cases  selected  for  use  in  build- 
ing the  model  and  those  used  in  testing  the  model.  It  was  questionable 
\ whether  the  data  samples  were  representative  of  the  population.  Hence, 

i 

a more  complete  anadysis  was  undertaken  to  determine  the  validity  of 
the  data  samples  with  respect  to  the  quality  of  a model  developed  using 
{ the  data.  The  extended  analysis  was  performed  on  the  ALL  ENGINES, 

TURBOJETS  and  TURBOFAN  categories. 

Extended  Cost  Model  Analysis 

j Method  of  Analysis.  To  determine  whether  the  data  in  each  category 

I could  be  expected  to  yield  a model  with  an  R^  - . 8000,  a series  of 

multiple  linear  regression  (MLR)  models  were  developed  for  each  j 

1 

data  category.  In  each  category,  an  engine  record  was  removed 

ii 
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from  the  data  file  and  an  MLR  model  developed  using  the  remaining 

cases.  The  procedure  was  followed  until  each  engine  case  had  been 

sequentially  removed  and  a model  developed.  The  distribution  of  R^ 

values  for  the  models  in  each  category  were  studied  with  respect  to 

the  mean  (expected)  value,  the  range,  and  the  number  of  models  with 

an  R^  — .8000.  If  the  expected  R^  value  was  equal  to  or  greater  than 

. 3000,  the  range  of  R^  values  was  small,  and  the  number  of  R^  values 

less  than  .8000  was  small,  the  data  could  be  considered  representa- 

2 

tive  of  the  population.  Further,  an  R value  equal  to  or  greater  than 
. 8000  for  a model  developed  from  such  data  could  be  considered  reli> 
able.  Further,  that  when  subjected  to  an  F*test,  the  lack  of  statistical 
significance  of  a model,  with  an  R^  > .8000,  could  be  attributed  to  the 
lack  of  statistical  leverage  resulting  from  the  large  number  of  inde- 
pendent variables  in  the  model  and  the  small  data  sample  size  (10). 
Then  variables  were  chosen  to  construct  a representative  model  of 
the  data  by  comparing  variables  in  models  with  R^  values  closest  to 
the  expected  R^  value  of  each  respective  data  category.  The  number 
of  variables  chosen  would  be  equal  to  or  less  than  the  number  appear- 
ing in  the  models  compared,  i.  e. , if  there  were  17  variables  in  each 
of  the  models  compared,  then  17  would  be  chosen  for  use  in  the 
remainder  of  the  analysis.  A smaller  number  of  variables  were 
chosen  if  the  data  sample  was  not  large  enough  to  ensure  adequate 
degrees  of  freedom  in  the  denominator  when  computing  a model 
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F -value  with  all  variables  in  the  model. 

A computer  program  (MREG),  explained  in  Appendix  D,  was  j 

I 

used  to  randomly  divide  the  data  into  a percentage  for  model  develop- 
ment and  a percentage  for  testing  the  developed  model  with  respect  to 
expected  predictive  efficiency  and  its  expected  predictive  capability. 

A predictive  efficiency  of  95  percent  and  a predictive  capability  of 
from  0 to  50  percent  was  desirable.  In  order  to  attain  a representa- 
tive distribution,  MBEG  was  run  and  models  built  and  tested  1000 
times  for  each  of  the  data  categories. 

The  number  of  variables  used  was  then  reduced  so  that  the 
value  of  n-p  (refer  to  Appendix  B,  F-tests)  could  be  increased  result- 
ing in  greater  statistical  leverage  in  the  model  and  causing  the  confi- 
dence intervals  (refer  to  Appendix  B;  confidence  intervals)  for  the  i 

test  cases  to  be  narrowed. 

MBEG  was  rtin  another  1000  times  with  a reduced  number  of 
variables  for  each  data  category.  This  second  run  was  to  determine 
whether  or  not  the  model's  expected  predictive  efficiency  would 
remain  at  a level  equal  to  or  greater  than  the  level  attained  with  a 

greater  number  of  variables  and  wider  confidence  intervals.  If  the  | 

original  level  and  the  subsequent  level  were  both  greater  than  95 
percent,  it  could  be  considered  that  a model  developed  from  the  par- 
ticular data  category  using  all  the  data  would  remain  at  least  95 
percent  efficient  (10). 


I 
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The  variables  selected  for  elimination  in  the  second  itera- 


tions were  chosen  by  comparing  the  variables  in  the  same  models  | 

used  to  initially  choose  the  variables  for  further  analysis  and  by  the 

results  of  previous  correlation  analysis.  The  point  at  which  the 

value  in  each  model  exceeded  . 8000,  the  minimum  acceptable 

was  considered  a cutoff  point,  and  any  variables  which  entered  the 

model  beyond  that  point  were  eliminated. 

ALL  ENGINES.  As  shown  in  Appendices  A-4  and  C-2.  1,  the  ALL 
ENGINES  category  data  sample  contained  28  engine  cases.  Twenty- 
eight  separate  models  were  developed  utilizing  the  independent  vari- 
ables listed  in  Appendix  C-25  with  a different  engine  case  eliminated 
from  the  data  each  time.  The  R^  values  ranged  from  . 7928  to  .9185. 

The  R^  of  . 7928  was  the  only  value  below  . 8000  and  occurred  when 
the  engine  data  for  the  TF39-GE-1  engine  was  eliminated  from  the 
sample.  The  range  of  the  remaining  values  was  .8761  to  .9185;  a 
spread  of  . 0424.  The  mean  R^  value  for  the  distribution  was 
.8901. 

Five  models  whose  R^  values  were  clustered  about  the  mean 
were  used  to  choose  the  initial  variables  for  analysis.  These  vari- 
ables are  listed  in  Appendix  C-25.  MREG  was  run  1000  times  utiliz- 
ing the  variables  chosen  and,  each  time,  80  percent  of  the  data- 

sample  was  used  to  construct  the  models.  It  was  determined  from 

2 

the  models  developed  that  an  R value  of  . 9138  with  a range  of  . 3593, 


a percent  predictability  of  417.  69  percent  with  a range  of  72,  309,  and 
a predictive  efficiency  of  96.  56  percent  with  a range  of  88.  89  could 
be  expected  of  a model  developed  from  the  data  utilizing  the  variables 
referenced. 

When  the  variables  used  were  reduced  to  those  listed  in 
Appendix  C-26,  the  degrees  of  freedom  were  increased  from  four  to 
seven,  the  confidence  intervals  were  reduced  by  14  percent,  and  the 
expected  predictive  efficiency  dropped  to  94.03  percent,  a decrease 
of  2.53  percent.  Appendix  C-27  provides  comparative  statistics  of 
R^,  percent  predictability,  and  percent  efficiency  at  degrees  of 
freedom  equal  to  five  and  sixteen. 

TURBOJETS.  The  TURBOJETS  category  data  sample  is  comprised 
of  19  engine  cases  as  indicated  in  Appendix  C-2.2.  Nineteen  models 
considering  the  independent  variables  listed  in  Appendix  C-28  were 
developed  with  R^  values  ranging  from  . 7004  to  . 9996.  Seven  of  the_ 
values  ranged  from  .7004  to  .7848,  two  of  the  values  were  between 
.8000  and  .9000,  and  the  remaining  ten  ranged  from  .9071  to  .9996, 
The  mean  R^  value  was  . 8651. 

The  six  models  closest  to  the  mean  were  compared  to  choose 
the  initial  variables  in  this  category.  Because  of  the  smaller  sample 
size  n = 19,  90  percent  of  the  sample  was  used  for  model  development 
in  each  of  the  1000  models.  The  models  yielded  an  expected  R^  value 
of  .6928  with  a range  of  .8655,  an  expected  percent  predictability  of 


r ’ 


289  percent  with  a range  of  67,608,  and  an  expected  predictive 
efficiency  of  93.  32  percent  with  a range  of  from  zero  to  100  percent. 

When  the  variables  were  reduced  to  those  listed  in  Appendix 
C-29,  the  degrees  of  freedom  were  increased  from  four  to  seven,  the 
confidence  intervals  narrowed  by  15  percent,  and  the  expected  predic- 
tive efficiency  was  reduced  to  92.74  percent,  a drop  of  .58  percent. 

Appendix  C-30  provides  a comparison  of  the  R^,  percent 
prediction,  and  percent  efficiency  for  degrees  of  freedom  four  and 
seven  respectively. 

TURBOFANS.  The  nine  turbofan  engines  are  shown  in  Appendix 
C-2.  3.  The  range  of  R^  values  for  the  nine  models  developed  was 
from  .9910  to  1.0000  with  a mean  value  of  .9988. 

Since  the  range  of  R^  values  was  so  small  (.  009),  all  of  the 
models  were  compared  to  choose  the  initial  variables  that  are  listed 
in  Appendix  C-31. 

Again,  because  of  the  small  sample  size,  90  percent  of  the 
sample  was  used  to  build  each  of  the  1000  models.  It  was  determined 
that  an  R^  value  of  .9986  with  a range  of  .0027,  a percent  predicta- 
bility of  54.25  percent  with  a range  of  127,  and  a predictive  efficiency 
of  87.  70  percent  with  a range  of  from  zero  to  100  percent  could  be 
expected  of  a model  developed  from  the  variables  and  daLa  sample 

used.  1 
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The  number  of  variables  were  also  reduced  (Appendix  C-32) 
in  this  category  increasing  the  degrees  of  freedom  from  one  to  four 
and  reducing  the  confidence  intervals  by  75  percent  resulting  in  a 
diminished  predictive  efficiency  of  86.82  percent,  a reduction  of  .88 
percent.  Appendix  C-33  provides  a comparison  of  R^,  percent  pre- 
dictability, and  percent  efficiency  for  degrees  of  freedom  one  and 
four. 

Final  Models.  An  MLR  model  was  then  developed  for  each  category 
using  the  representative  variables  initially  chosen  for  further  analysis, 
(Appendices  C-25,  C-26,  and  C-31),  To  develop  these  models,  the 
SPSS  forward  inclusion  technique  was  used  and  all  of  the  engine  data 
cases  were  used  for  each  respective  category.  The  model  for  each 
category  is  listed  in  Appendices  C-34,  C-35,  auid  C-36. 

Findings 

Many  points  in  the  data  analysis  indicate  that  the  turbofan 

2 

data,  when  combined  with  the  turbojet  data,  caused  the  high  R for  the 
ALL  ENGINES  category. 

The  R^  value  was  .9003  for  ALL  ENGINES,  1.  0000  for 
TURBOFANS,  and  .6100  for  TURBOJETS.  Only  one  of  the  variables 
in  the  ALL  ENGINES  model  made  a significant  marginal  contribution 
while  adl  of  the  variables  in  the  TURBOFANS  model  made  extremely 
significant  contributions.  None  of  the  variables  in  the  TURBOJETS 


52 


category  made  a significant  contribution. 

The  distribution  of  values  for  ALL  ENGINES  had  one 
value  less  than  . 8000,  amd  the  range  of  the  remaining  values  was  only 
. 04.  The  values  for  TURBOFANS  were  tightly  clustered  with  high  R^ 
values  covering  a range  of  only  .0090.  However,  the  R^  values  for 
TURBOJETS  were  not  distributed  as  well;  the  range  was  .2992  with 
seven  values  below  .8000,  two  values  between  .8000  and  .9000,  and 
the  remaining  10  values  were  greater  than  .9000. 

Further  analysis  of  the  1000  models  developed  in  each  cate- 

2 

gory  supported  the  observation.  The  mean  R value  was  .9138  for 
ALL  ENGINES,  . 6928  for  TURBOJETS,  and  .9986  for  TURBOFANS. 
The  percent  predictability  was  417.69  with  a range  from  14.  99  to 
72324  for  ALL  ENGINES,  288.  99  with  a range  of  2.  18  to  67,  608.  20 
for  TURBOJETS,  and  54.25  with  a range  of  .93  to  127.  66  for 
TURBO  FANS. 

One  reason  for  the  TURBOFAN  data’s  influence  on  the  expir- 
atory power  (R^  value)  of  the  ALL  ENGINES  model  could  be  due  to 
more  accurate  cost  figures  in  the  data  sample.  Another  reason  might 
be  that  the  turbofan  data  is  more  representative  of  its  population  cr  a 
combination  of  both  reasons  may  apply. 

A comparison  of  the  distributions  of  R values  for  each 
category  suggests  that  the  data  in  the  TURBOJETS  category  is  not 
stable  throughout  the  data  sample  while  the  data  in  the  TURBOFANS 
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appears  to  be  stable.  Since  the  ALL  ENGINES  category  is  comprised 
of  the  turbojet  and  turbofan  data,  the  relationships  of  turbofan  data 
•with  COST  may  be  masking  and  assisting  a weak  relationship  between 
turbojet  data  and  COST. 

The  final  chapter  will  present  the  summary,  conclusions  and 
recommendations  resulting  from  the  research.  In  addition,  recom- 
mendations for  extended  research  are  provided. 


CHAPTER  rV 


SUMMARY,  CONCLUSIONS,  AND  RECOMMENDATIONS 
FOR  FUTURE  STUDY 

SUMMARY 

The  objective  of  this  research  effort  was  to  develop  depot 
level  cost  estimating  relationships  (CERs)  for  USAF  jet  core-engines 
utilizing  both  engine  operating  parameters  and  physical  engine  charac- 
teristics. Present  CERs  published  by  the  Air  Force  Logistics  Com- 
mand's (AFLC)  Independent  Cost  Analysis  Division  (ICAD)  utilize  pri- 
marily engine  operating  parameters  as  variables  to  predict  overhaul 

cost  (18). 

The  study's  scope  was  limited  to  the  development  of  CERs  . 
for  jet  core-engines.  This  limitation  is  in  contrast  to  present  AFLC 
CERs  which  attempt  to  predict  total  jet  engine  overhaul  cost  (i.  e., 
the  core-engine  plus  engine  mounted  accessories).  Since  engine 
accessory  overhaul  may  be  performed  independent  of  core-engine 
overhaul,  only  CERs  for  jet  core- engine  depot  level  overhaul  were 
investigated. 

The  engines  utilized  by  the  AFLC  to  develop  CERs  published 

in  AFLC  Pamphlet  173-4  were  selected  as  the  data  base  engines  for 
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this  research  effort.  Additionally,  engine  physical  characteristics 
were  considered  as  possible  overhaul  cost  drivers. 


Seven  engine  categories  were  initially  selected  for  CER 
development.  They  were: 

ALL  ENGINES 

TURBOJETS 

TURBOFANS 

TURBOFANS  (with  afterburner) 

TURBOFANS  (without  afterburner) 

TURBOJETS  (with  afterburner) 

TURBOJETS  (without  afterburner) 

Of  the  original  seven  engine  categories  investigated,  only  three  cate- 
gories (ALL  ENGINES,  TURBOJETS,  and  TURBOFANS)  were 
selected  for  final  model  development.  Final  models  were  developed 
by  utilizing  the  technique  of  multiple  linear  regression.  The 
researchers  established  model  acceptance  criteria  of  R^>  .8000,  at 
a 95  percent  confidence  level,  with  a predictive  efficiency  of  not 
greater  than  50  percent,  and  an  overall  model  efficiency  not  lower 
than  95  percent.  Eighty  percent  of  the  ALL  ENGINES  data  base 
records  were  randomly  selected  for  model  development,  and  90  per- 
cent of  the  TURBOJETS  and  TURBOFANS  data  base  records  were 
randomly  selected.  The  values  80  and  90  percent  were  decided  upon 
by  the  researchers  after  considering  the  sample  sizes,  n,  in  each 

56 


category  (n  = 28  for  ALL  ENGINES,  n = 19  for  TURBOJETS  and 


n = 9 for  TURBOFANS).  Models  were  developed  in  each  category 
using  the  randomly  selected  data  bases  and  the  model  acceptance 
criteria  was  applied  to  them.  Final  models  for  the  engine  categories 
ALL  ENGINES,  TURBOJETS,  and  TURBOFANS  were  developed 
utilizing  100  percent  of  the  data  records  in  each  category. 

This  research  met  the  objective  which  was  to  develop  cost 
estimating  relationships  that  utilized  both  engine  operating  parameters 
and  physical  engine  characteristics.  The  final  CER  developed  for  the 
engine  category  ALL  ENGINES  (Appendix  C-34)  utilized  16  independent 
variables  of  which  seven  are  operating  parameters,  e.  g.,  Xl6 
(MAXTIT)  and  X17  (NORMSFC),  and  nine  of  which  are  physical  char- 
acteristics, e.g. , XI  (CR)  and  X6  (L),  The  final  model  developed 
for  the  engine  category  TURBOJETS  (Appendix  C-35)  utilized  14 
variables  of  which  seven  were  operating  parameters  and  seven  were 
physical  characteristics.  The  TURBOFANS  cost  model  (Appendix  ' 
C-36)  contained  six  variables  of  which  two  were  operating  parameters 
and  four  were  physical  engine  characteristics. 

Although  the  research  supported  the  concept  of  utilizing  both 
engine  operating  parameters  and  physical  characteristics,  only  two 
(ALL  ENGINES  and  TURBOFANS)  of  the  final  three  models  developed 
met  the  minimum  R^  criterion  (.  8000)  at  the  95  percent  confidence 
level.  However,  neither  of  these  models  satisfied  the  remaining  two 
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criteria  for  model  validation,  i,  e.  , a predictive  efficiency  of  not  less 
than  95  percent  and  a percent  predictability  not  less  than  50  percent. 

CONCLUSIONS 

In  this  section  the  conclusions  drawn  from  the  research 
effort  are  presented.  It  must  be  realized  that  some  of  the  conclusions 
discussed  are  only  applicable  within  the  scope  of  this  study.  On  the 
other  hand,  some  may  have  general  applicability. 

Initial  models  developed  for  the  ALL  ENGINE  and  the 
TURBOFAN  category  had  R^  values  of  .9003  and  1.0000,  respectively. 

However,  the  model  developed  for  the  TURBOJET  category  had  an 

2 2 
R of  only  .6101,  a value  less  than  the  minimum  acceptable  R value 

of  . 8000, 

Upon  analyzing  the  models  developed  for  the  categories  of 
ALL  ENGINES  and  TURBOJETS  it  was  discovered  that  only  one,  XI6 
(MAXTIT),  of  the  14  variables  in  the  ALL  ENGINE  model  (.Appendix 
C-13)  made  a significant  marginal  contribution  to  the  model's  R^ 
value.  None  of  the  13  variables  in  the  TURBOJETS  model  (Appendix 
C-15)  made  a significant  marginal  contribution  to  the  model's  R^ 
value. 

The  low  R^  value  for  the  TURBOJETS  model  and  the  fact  that 
only  XI6  in  the  ALL  ENGINES  model  made  a significant  marginal 
contribution  to  either  of  the  models'  R^  values  indicated  a potential 
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problem  area.  It  was  surmised  that  the  COST  variable  data  and/or 
the  independent  variable  data  for  the  TURBOJET  and/or  the  ALL 
ENGINES  category  was  of  questionable  reliability.  Further,  it  was 


felt  that  the  value  for  a model  developed  using  data  from  either  of 
the  two  categories  would  depend  on  which  segment  of  the  data  was 
used  for  model  development* 

Further  analysis  was  performed  to  determine  if  what  was 

surmised  was  true.  One  engine  data  record  at  a time  was  held  out  of 

each  data  category  and  then  replaced  after  a model  was  developed. 

This  procedure  was  followed  until  all  engines  in  each  category  had 

been  held  out.  Thus,  28  models  were  developed  for  ALL  ENGINES, 

19  for  TURBOJETS  and  9 for  TURBOFANS.  The  distribution  of  R^ 

values  for  the  TURBOJETS  models  yielded  am  expected  R^  value  of 

.8651.  However,  the  values  ranged  from  .7004  to  . 9996,  a spread 

of  .2992,  with  seven  of  the  nineteen  values  in  the  .7000  to  .8000 

range,  two  values  between  .8000  and  .9000  and  ten  values  above 

.9000.  No  two  R^  values  were  the  same.  The  distribution  for  this 

2 

category  indicated  that  the  data  was  unstable  and  that  the  R value 
was  dependent  on  the  particular  engine  data  records  used  for  model 
construction.  The  distribution  of  the  nine  values  for  TURBOFANS 
models  revealed  an  expected  R^  value  of  .9988  with  values  ranging 
from  .9910  to  1.  0000,  a spread  of  only  .009.  The  turbofan  data 
consistently  provided  a model  R^  value  greater  than  .9900  irrespective 
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of  the  data  records  used  for  model  construction.  The  distribution  for 


ALL  ENGINES  was  better  than  the  distribution  for  TURBOJETS  but 
worse  than  the  distribution  for  TURBOFANS.  The  expected  R^  value 
for  ALL  ENGINES  was  .8901  and  the  values  ranged  from  .7928  to 
.9185,  a spread  of  . 1257  indicating  that  the  R^  value  was  dependent 
on  the  engine  data  records  used. 

An  interim  conclusion  was  made  that  the  turbojet  data  was 
unstable,  not  representative  of  the  population,  and  could  not  produce 
a reliable  model.  Further,  the  turbofcui  data  appeared  very  stable 
and  consistent  and  when  the  turbofan  and  turbojet  data  were  combined 
in  the  ALL  ENGINES  category,  the  turbofah  data  greatly  influenced 
the  r2  values  for  models  developed. 

An  extended  analysis  was  performed  on  each  category  utilizing 
the  computer  program  MREG  (Appendix  D).  One  thousand  models 
were  developed  for  each  category  and  zin  average  (expected)  value 
determined,  in  each  category,  for  R^  percent  predictability,  and  pre- 
dictive efficiency.  The  TURBOJETS  category  exhibited  an  R^  value  of 
.6928,  with  a range  of  . 8655,  a percent  predictability  of  289  percent 
with  a range  of  67,608,  and  a predictive  efficiency  of  93.32  percent 
with  a range  of  0 to  100  percent.  The  TURBOFANS  category  dis- 
played an  R^  value  of  . 9986  with  a range  of  .0027,  a percent  predict- 
ability of  54.  25  percent  with  a range  of  127  and  a predictive  efficiency 
of  87.70  percent  with  a range  of  0 to  100  percent.  The  ALL  ENGINES 
category  had  an  R^  value  of  .9138  with  a range  of  . 3593,  a percent 
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predictability  of  418  percent  ■with  a range  of  72,  309,  and  a predictive 
efficiency  of  96.  56  percent  with  a range  of  11.  11  to  100  percent. 

The  extended  analysis  supported  the  interim  conclusion. 

The  turbojet  data  is  faulty  and  cannot  be  expected  to  produce  a useful 
and/or  reliable  model.  Further,  the  effect  of  the  turbojet  data  in  the 
ALL  ENGINES  category  has  rendered  the  data  base  in  that  category 
unreliable  for  the  development  of  valid  models. 

The  TURBOFAN  category  appears  to  have  potential  for  reli- 
able model  development.  However,  the  very  small  sample  size,  nine 
data  points,  as  shown  in  Appendix  C-2.  1,  appears  to  have  restricted 
that  capability. 

A high  R value  can  be  attained  by  introducing  a large  number 
of  variables  into  the  model  each  with  insignificant  marginal  contribu- 
tion. However,  the  model  may  have  virtually  no  predictive  capability. 
This  was  evidenced  by  the  14  variables  in  the  initial  ALL  ENGINES 
model  (Appendix  C-13)  cmd  the  13  variables  in  the  initial  TURBOJETS 
model  (Appendix  C-15),  Therefore,  a high  R^  value  does  not  imply 
that  the  model's  predictive  capability  is  good. 

The  most  significant  discovery  in  this  research  is  that 
models  that  are  statistically  significant  with  high  R^  values  indicating 
high  explanatory  power,  i.  e. , ability  to  explain  variation  in  the  data, 
can  be  developed  from  questionable  and  possibly  ur.reliable  data. 

The  models  appear  to  be  acceptable  if  taken  at  face  value.  Further 
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analysis,  however,  may  indicate  they  are  impractical  and  of  question- 
able utility  as  cost  estimators.  Therefore,  a cost  estimating  model 
cannot  be  accepted  on  the  basis  of  value  alone.  One  must  analyze 
the  data  base  for  reliability  and  the  model  for  predictive  capability. 

As  indicated  in  Appendices  C-34,  C-35,  and  C-36  physical 
characteristics  constitute  at  least  50  percent  of  the  variables  in  the 
final  model  constructed  for  each  engine  category.  In  the  ALL 
ENGINES  category,  seven  of  the  16  independent  variables  were  phys- 
ical characteristics,  for  TURBOJETS  seven  of  14  were  physical 
characteristics,  and  for  TURBOFANS  four  of  six  were  physical 
characteristics.  The  high  percentage  of  physical  characteristic  vari- 
ables in  each  model  indicates  the  potential  for  the  development  of 
accurate  and  usable  cost  estimating  models  with  engine  physical 
characteristics  as  major  cost  drivers.  In  addition,  the  technique 
used  in  this  study  of  utilizing  a percentage  of  the  data  base  for  model 
development  and  reserving  the  remainder  of  the  base  for  model  test- 
ing permits  discovery  of  potential  problems  that  would  not  normally 
be  detected.  However,  the  data  base  must  not  be  diluted  to  the  point 
where  small  degrees  of  freedom  limit  its  usefulness. 

RECOMMENDATIONS  FOR  FURTHER  STUDY 

As  a result  of  this  research  there  were  several  areas  ident- 
ified where  extended  analysis  may  provide  greater  insight  in  the 
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development  of  cost  estimating  relationships.  Areas  for  further  i 

cost  estimating  study  are: 

First,  cost  estimating  relationships  for  predicting  depot 
level  overhaul  costs  for  jet  engine  accessories  should  be  investigated. 

A study  in  this  area  would  be  complimentary  to  the  research  conducted 
by  this  study. 

Second,  a jet  engine  is  considered  to  have  three  major 
sections  (a  compressor  section,  a combustor  section,  and  a turbine 

section).  A study  should  be  undertaken  to  investigate  the  predictive  ' 

capability  of  cost  estimating  relationships  developed  to  estimate  depot 
level  overhaul  costs  relative  to  each  of  these  three  engine  sections. 

Third,  this  study  utilized,  as  the  dependent  variable, 
cost  expressed  on  a dollar  per  flight  hour  basis.  Further  research 
should  investigate  cost  expressed  in  terms  of  dollars  per  end  item 
overhauled. 

Finally,  depot  level  overhaul  cost  estimating  relation- 
ships developed  for  jet-core  engines  or  by  engine  sections,  should  be 
combined  with  cost  estimating  relationships  developed  for  depot  level 

overhaul  of  engine  accessories.  The  predictive  capability  of  this  j 

i 

combination  should  then  be  compared  to  the  predictive  capability  of  | 

total  engine  (core-engine  plus  engine  accessory)  depot  level  overhaul 
cost  estimating  relationships  presently  in  use  by  the  Air  Force 
Logistics  Command. 

J 
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The  area  of  cost  estimating  within  the  Department  of  Defense 


is  wide  open  and  many  important  discoveries  are  yet  to  be  made. 
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APPENDIX  A 

DATA  BASE  JET  ENGINES  AND  VARIABLES 
AND  ENGINE  CATEGORIES  INVESTIGATED 
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APPENDIX  A-1 
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Independent  Variables  Considered  by  Present 
AFLC  Jet  Engine  Overhaul  CERs  (23:23) 

Engine  Dry  Weight 
Turbine  Inlet  Temperature 
Normal  Thrust 
Maximum  Thrust 

Specific  Fuel  Consumption  (at  normal  thrust) 

Specific  Fuel  Consumption  (at  maximum  thrust) 

Thrust  to  Weight  Ratio  (at  normal  thrust) 

Thrust  to  Weight  Ratio  (at  maximum  thrust) 

Engine  Age 

Average  Flight  Time  Between  Overhaul 
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APPENDIX  A- 2 

List  of  Independent  Variables 


Name 

Number 

Abbreviation 

Number  of  Compressor  Rotors 

XI 

CR 

Number  of  Compressor  Blades 

X2 

CB 

Number  of  Turbine  Rotors 

X3 

TR 

Number  of  Turbine  Blades 

X4 

TB 

Number  of  Burner  Cans 

X5 

BC 

Engine  Length 

X6 

L 

Engine  Diameter 

' 

X7 

D 

• 

Length  to  Diameter  Ratio 

X8 

LD 

Engine  Dry  Weight 

X9 

W 

Thrust  to  Weight  Ratio 

XIO 

TW 

Rated  Thrust  at  Normal  Power 

XU 

NORMT 

f 

1 Rated  Thrust  at  Military  Power 

X12 

MILT 

' Rated  Thrust  at  Maximum  Power 

[ 

X13 

MAXT 

[ Rated  Turbine  Inlet  Temperature  at 

1 Normal  Power 

X14 

NORMTIT 

1 

Rated  Turbine  Inlet  Temperature  at 
Military  Power 

j 

X15 

MILTIT 

! Rated  Turbine  Inlet  Temperature  at 

i Maximum  Power 

X16 

IvIAXTIT 
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Name 


Number  Abbreviation 


r 


Normal  Specific  Fuel  Consumption 

X17 

NORMSFC 

Military  Specific  Fuel  Consumption 

X18 

MILSFC 

Maximum  Specific  Fuel  Consumption 

X19 

MAXSFC 

Cruise  RPM 

X20 

CRPM 

Maximum  RPM 

X21 

MRPM 

Fan  By-pass  Ratio 

X22 

FBR 

Engine  Pressure  Ratio 

X23 

EPR 

Maximum  Rated  Airflow 

X24 

AIR 

Average  Flight  Hours  Between  Overhaul 

X25 

AFHBO 

Total  Flight  Hours  (FY  1974,  1975,  1976) 

X26 

FH 

Average  Flight  Hours  Per  Year 

X27 

AFH 

i 

I 
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APPENDIX  A -3 


Thesis  Data.  Population 


J33-A-35 

J60-P-5A 

TF30-P-3 

J57-P-13 

J65-W-5 

TF30-P-7 

J57-P-19 

J69-T-25 

TF30-P-9 

J57-P-21 

J75-P-17 

TF30-P-100 

J57-P-23 

J75-P-19/19W 

TF33-P-3 

J57-P-29 

J79-GE-15 

TF33.P-5 

J57-P-43 

J79-GE-17 

TF33-P-7 

J57-P-55 

J85-GE-5 

TF33-P-9 

J57-P-59 

J85-GE-13 

TF39-GE-1 

APPENDIX  A -4 


Initial  Engine  Categories  Investigated 

Number  of  Engines 

Category  Per  Category 

100%  of  Sample  80%  of  Sample 

1.  ALL  ENGINES  28  22 

2.  TURBOJETS  19  15 

3.  TURBO  FANS  9 7 

4.  TURBOFANS  (with  afterburner)  4 3 

5.  TURBOFANS  (without  after- 

burner) 5 4 

6.  TURBOJETS  (with  afterburner)  10  8 

7.  TURBOJETS  (without  afterburner)  9 7 
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appendix  b 

REGRESSION  ANALYSIS* 

SIMPLE  LINEAR  REGRESSION 

Simple  linear  regression  (SLR)  involves  regressing  a depend- 
ent variable  (usually  plotted  on  the  Y axis)  on  an  independent  variable 
(usually  plotted  on  the  X axis).  The  SLR  technique  is  used  to  develop  a 
linear  regression  equation  which  fits  a line  to  the  sample  or  population 
data  points  such  that  the  line  can  be  used  as  an  estimate  of  Y for  a given 

A 

value  of  X.  The  regression  line  equation  is  of  the  form  Y = a + bX 
where  Y is  an  estimate  of  the  true  Y value.  Since  the  regression  line 
will  usually  not  fit  all  of  the  data  points  exactly,  there  will  be  a differ- 
ence between  the  Y and  the  true  value  of  Y for  some  values  of  X.  That 
difference  is  called  the  error  term  or  the  residual  term  e-.  Thus,  the 
population  regression  model  equation  is  expressed  as  Y = A + BXj^  + e^. 

Assumptions 

The  assumptions  that  are  made  when  working  with  SLR 
analysis  are; 

*A  working  knowledge  of  regression  analysis  on  the  part  of 
the  reader  is  assumed.  For  a more  in-depth  presentation  of  regression 
analysis,  the  reader  may  wish  to  consult  Applied  Linear  Statistical 
Models  by  Neter  and  Wasserman  (14). 


{i 

‘1 

1 


72 


1.  The  error  terms  (e^)  are  independent  for  all  values  of  X, 

2.  The  expected  value  of  the  error  terms  is  zero. 

3.  The  variance  of  the  error  terms  is  constant  for  all  values 


of  X. 

4.  The  error  terms  for  all  values  of  X are  distributed 
normally  with  a mean  of  zero  and  a variance  of 

LEAST  SQUARES  IvfETHOD 

For  a regression  model  to  fit  the  data  as  well  as  possible  the 

effect  of  the  error  terms  must  be  minimized.  Since  the  (cj^'s)  may  be 

positive  or  negative,  their  sum  may  equal  zero  while  large  ej^  values 

may  still  exist.  The  absolute  value  function  of  the  error  terms  is  not 

differentiable  over  the  entire  range  of  the  function  thus  a minimum  value 

cannot  be  determined.  The  Least  Squares  Method  (LSM)  attempts  to 

2 

minimize  the  sum  of  the  square  of  the  error  terms  (I  e^  ).  Partial 
derivacives  of  the  sum  are  taken  with  respect  to  A and  B and  set  equal" 
to  zero.  Solving  the  system  of  normal  equations  that  results  yields 
equations  for  the  estimation  of  A and  B via  sample  estimators,  a and  b: 

A = a = ^^jyi  - ^i^ 

Ixi 

B s b = ^^j/i  -^^jyj 
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Once  a and  b are  computed,  the  regression  equation  Y(est)  = a + bx  is 
determined  for  a given  value  of  x. 


Since  multiple  linear  regression  (MLR)  is  the  technique  used 
in  this  thesis,  SLR  will  not  be  discussed  further.  All  statistical  tests 
and  acceptance  criteria  will  be  explained  using  MLR  terminology  and 
notation. 

MULTIPLE  LINEAR  REGRESSION 

Multiple  Linear  Regression  enables  regression  of  Y on  two  or 
more  independent  variables  (x^'s).  The  general  MLR  population  model 
is: 

Y = Bg+BiXii+B2Xi2  + ...  +Bp_iXi,p-l  + ei  i = 1,  2,  . . . , n 

With  MLR,  p regression  coefficients  (B's)  are  estimated  using  matrix 
operations  and  thus  the  model  can  be  written: 

Y = XB  + e 


where 


B = 


e = 


(pxl) 


n s number  of  sample  observations 


The  Least  Sqviares  Method  (LSM)  is  again  used  to  estimate 


the  B's  and: 


B (est)  s b s 


= (X'^X)"^  (X'^Y) 


(pxl) 


Assvimptions 


By  the  Gauss-Markov  Theorem,  LSM  estimators  are  the  best 
linear  unbiased  estimators  of  the  population  regression  coefficients. 
However,  the  following  assumptions  must  be  valid  for  the  least  squares 
method  of  MLR  to  be  applicable  (5:528-530;  14:38): 

1.  The  four  assumptions  listed  for  SLR. 

2.  A linear  model  is  appropriate. 

3.  The  number  of  sample  observations,  n,  exceeds  the 
number  of  parameters  to  be  estimated,  p,  L.  e. , n>p. 

4.  The  rank  of  the  X matrix  is  p. 

5.  The  Xi's  are  observed  without  error. 
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Decompoaition  of  Variation 

If  the  asaumption  that  a linear  model  la  appropriate  la  not 
made,  i.  e. , no  regreaaion  of  Y on  the  Xi'a,  then  the  beat  point  eatl- 
mator  of  Y is  its  mean  (Y)  and  there  will  be  variation  of  the  actual  Y 
values  about  Y.  When  Y is  regressed  on  the  Xi's,  an  attempt  is  made 
to  explain  as  much  of  the  variation  as  possible  by  the  regression  model. 

The  total  variation  (TV)  or  sum  of  squares  total  (SST)  may 
be  decomposed  into  explained  variation  (EV)  or  regression  sum  of 
squares  (SSB.)  and  unexplained  variation  (UV)  or  error  sum  of  squares 
(SSE)  such  that: 

TV  = EV  + UV 
OR 

SST  = SSR  + SSE 


! SSR  is  the  component  of  total  variation  explained  by  the  regression 

model  and  SSE  is  the  component  of  total  variation  unexplained  by  the 
regression  model.  The  regression  mean  square  (MSR)  is  SSR  divided 

[ 

by  p-1  and  the  error  mean  square  (MSE)  is  SSE  divided  by  n-p.  These 
statistical  parameters  are  discussed  later  in  the  text.  Table  1 lists 

i 

I computational  equations  for  the  respective  components  of  variation. 


1 
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Table  I 

Computational  Equations  for  the  Components 
of  Variation 


Source 

Sum  of  Squares 

df 

Mean  Squares 

Explained  by 
regression 

SSR  a b'^(x’^Y)  - nY^ 

P-1 

MSR  a 

P-1 

Unexplained  by 
regression 

SSE  a y'^Y  - b'^(X'^Y) 

MSE  a 

n-p 

Total 

SST  a y'^Y  - nY^ 

The  Coefficient  of  Determination  (R^) 

The  greater  the  portion  of  total  variation  explained  by  the 
regression  model,  the  better  the  model  fits  the  data. 

The  coefficient  of  multiple  determination  (H^)  is  used  as  a 
measure  of  how  well  the  model  fits  the  data.  is  defined  as: 

r2  . SSR 
" SST 

2 

The  closer  R is  to  one,  the  greater  the  amount  of  variation  explained 
by  the  regression  model. 

F Tests 

To  test  the  statistical  significance  of  the  overall  regression 
model  an  F ratio  is  computed  from  sample  data  and  compared  to  a 
critical  F value  in  an  attempt  to  reduce  or  resolve  the  hypotheses  test: 

Hq:  AU  = 0 


1 


i 


A 
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At  least  one  B£j40i=l,...,p-1 

The  sample  F ratio: 

V - MSR 
■ MSE 

yields  an  which  is  compared  to  p-l,n.p* 

Fcrit  is  obtained  from  a standard  F-table  for  a level  of  significance 
l-ot  and  for  degrees  of  freedom  p-l,n™p.  If  F<  cannot  be 

rejected  and  there  is  insufficient  evidence  to  conclude  Hj:  At  least  one 
Bj  ^ 0 i = 1,  . . . , p-1.  If  F>Fj.j.j^^,  can  be  rejected  and  the  conclusion 
made  that  the  over<dl  regression  model  is  statistically  significant  at 
the  chosen  level. 

To  test  the  statistical  significance  of  the  marginal  contribu- 
tion of  the  X^'s,  the  estimated  variance -covariance  matrix  of  the  b's 
must  be  computed: 

s2(b)  = MSE(X'^X)-1 

s(bj^)  is  obtained  from  the  s (b)  matrix  and  a t-statistic  is  computed  for 
the  regression  coefficient  of  the  respective  X^: 


The  t-statistic  is  compared  to  a critical  t value,  a/2’n-p  obtained 

from  a standard  t-table  for  a level  of  significance,  a and  associated 
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degrees  of  freedom  n-p. 


When  testing  the  simultaneously,  each  test  must  be  con- 
ducted at  an  equivalent  a level  so  that  the  overadl  system  of  tests  main- 
tains a confidence  of  (1-a)  x 100%.  A conservative  approach  to  simul-  j 

taneous  testing  which  insures  at  least  a (1-a)  x 100%  level  of  confidence 
is  the  Bonferroni  technique.  This  technique  yields  an  equivalent  a(a'); 

a'=iL 
P-1 

The  con^arison  of  t-statisdcs  is  used  to  reduce  or  resolve  the 
hypothesis  test: 

Ho:  Bi  = 0 
Hj:  0 

If  It!  > reject  and  conclude  that  the  marginal  contribution  of 

Xi  is  statistically  significant  at  the  chosen  a level.  If  ltl<  Hq 

cannot  be  rejected  and  there  is  insufficient  evidence  to  conclude  that  the 
marginal  contribution  is  significant  (14:146-149). 

Confidence  Intervals 

Since  regression  models  provide  only  an  estimate  of  the 
actual  value  of  the  dependent  variable  Y,  most  of  the  time  the  actual 
value  of  Y will  not  coincide  with  the  estimated  value,  i.  e.  , the  actual 
value  will  not  normally  be  on  the  regression  surface. 
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To  determine  the  confidence  that  may  be  placed  in  a regres- 
sion model  as  a predictor,  confidence  intervals  can  be  constructed  about 


the  estimates  rendered  by  the  model. 

A confidence  interval  (Cl)  is  constructed  about  the  estimates 
of  a regression  model  at  a chosen  level  of  significance,  a.  A decision 
maker  can  then  be  (1-a)  x 100%  confident  that  the  actual  value  of  Y will 
be  no  farther  than  one  half  of  the  Cl  away  from  the  estimate  for  Y. 


Standard  Error  of  the  Estimate 

The  standard  error  of  the  estimate  is  a measure  of  disper- 
sion of  the  error  terms  (ej^'s)  about  the  locus  of  the  regression  model 
estimates,  i.  e.,  the  regression  surface,  and  is  computed; 


where: 

n = the  number  of  cases  in  the  sample 

p = the  number  of  x.'s  in  the  model  plus  one 

UV  = the  amount  of  variation  unexplained  by  the  model. 
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APPENDIX  C 

SUPPORTING  DATA  AND  FORMULATED 
COST  ESTIMATING  RELATIONSHIPS 
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APPENDIX  C-1 


Independent  Variables  Considered  in  the  Development 
of  CERs  by  Engine  Category 
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(W/O  AB) 


APPENDIX  C-2.  I 


ALL  ENGINES 

Eagines  Considered  in  Correlation  Analysis 
and  Model  Development 


J33-A-35 

J57-P-59 

J85-GE-13 

J57-P-13 

J60-P-5A 

TF30-P-3 

J57-P-19 

J65-W-5 

TF30-P-7 

J57-P.21 

J69-T-25 

TF30-P-9 

J57-P-29 

J79-GE.15 

TF30-P-100 

J57-P-43 

J79-GE-17 

TF30-P-3 

J57.P.55 

J85-GE-5 

TF33-P-5 

TF33-P-7 

Engines  Set  Aside  for  Model  Validation 

J57-P-23 

J75-P-17 

TF33-P-9 

J60-P-3 

J75-P-19 

TF39-GE-1 

k J 


APPENDIX  C-2.2 

TURBOJETS 

Engines  Considered  in  Model  Development 


J33-A-35 

J57-P-55 

J75.P.19 

J57-P-13 

J57-P-59 

J79-GE.15 

J57-P-21 

J60-P-5A 

J79-GE-17 

J57-P-23 

J65-W-5 

J85-GE-5 

J57-P-29 

J69-T-25 

J85-GE.13 

J57-P-43 

J75-P-17 

Engines  Utilized  for  Model  Testing 


APPENDIX  C-2.3 


TURBOFANS 

Engines  Used  for  Model  Development 

TF30-P-3  TF30-P-100  TF33.P-5 

TF30-P-9  TF33-P-3  TF33-P-7 

TF33-P.9 

Engines  Used  for  Model  Testing 
TF30-P-7  TF39-GE-1 


85 


APPENDIX  C-2.4 

TURBOFANS  (with  afterburner) 

Engines  Used  for  Model  Development 

TF30-P-3  TF30-P-7  TF30-P-9 

Engine  Used  for  Model  Testing 

TF30-P-100 


APPENDIX  C-2.5 


TURBOFANS  (without  afterburner) 

Engines  Used  for  Model  Development 

TF33-P-3  TF33-P-7  TF33-P-9 

TF33-P-5 

Engine  Used  for  Model  Testing 
TF39-GE-1 


r 


J57-P-13 

J57-P-23 

J57-P-55 


J57-P-21 


APPENDIX  C-2.6 


turbojets  (with  afterburner) 
Engines  Used  for  Model  Development 


J75-P-17 

J75-P-19 

J79-GE-15 


J79*GE-17 

J85-GE-5 


Engines  Used  for  Model  Testing 


J85-GE-13 


1 


APPENDIX  C.2.7 


TURBOJETS  (without  afterburner) 

Engines  Used  for  Model  Development 

J33-A-35  J57-P-59  JbO-P-SA 

T57-P-29  J60-P-3  JbS-W-S 

J57-P-43 

Engines  Used  for  Model  Testing 
J57-P-19  J69-T-25 
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APPENDIX  C-4 


ALL  ENGINES 
n * 22 

Bivariate  coefficient  of  correlation  between  independent  variables 
which  met  the  modified  Rule  1 criterion  > . 5000)  with  COST. 


APPENDIX  C-5 


ALL  ENGINES 
n s 66 

First  Order  Coefficients  of  Partial  Correlation 

COST  with 


Controlling 


for 

X6 

X8 

X16 

X23 

X25 

XI  (CR) 

.6130 

.4153 

.4329 

.4681 

-.5686 

X2  (CB) 

.6525 

.4360 

.4768 

.5199 

-.5863 

X3  (TR) 

.6101 

.4865 

.4242 

.6557 

-.5688 

X4  (TB) 

.6789 

. 5330 

. 5238 

.7058 

-.6116 

X5  (BC) 

.6667 

. 5294 

. 5568 

. 5636 

-.6316 

X6  (L) 

— 

.0956 

. 1990 

. 1227 

..  3680 

X7  (D) 

.6716 

. 5899 

. 5191 

.6226 

-.6157 

X8  (LD) 

. 5016 

.4125 

.5155 

-.4709 

X9  (W) 

.6854 

.4712 

. 5003 

.5091 

-.6005 

XIO  (TW) 

.6907 

.4898 

. 5157 

.5645 

-.6053 

XI 1 (NORMT) 

.6645 

. 5532 

.5143 

.6696 

-.6152 

X12(lvGLT) 

.6700 

.5468 

. 5145 

.6797 

-.6168 

Xl6  (MAXTIT) 

.4969 

. 3722 

— 

. 2812 

-. 4261 

X17  (NORMSFC) 

.6455 

. 5366 

.5004 

.6592 

-. 5932 

X18  (MILSFC) 

.6506 

. 5448 

.5035 

.6612 

-.6014 
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\ COST  with 
Control!  in  gV 


for 

X6 

X8 

X16 

X23 

X25 

X20  (CRFM) 

.7093 

.5247 

. 5630 

. 5926 

..6285 

X21  (MRPM) 

.7094 

.5243 

.5629 

.5920 

.. 6285 

X23  (EPR) 

.4235 

.4359 

. 1464 

w w • 

-.4849 

X24  (AIR) 

.6883 

.5558 

. 5591 

.7321 

-.6269 

X25  (AFHBO) 

.4664 

.2595 

.2351 

.4113 

— 

X26  (FH) 

.6463 

.5118 

. 5307 

. 5750 

-.  5948 

X27  (AFH) 

.6463 

.5118 

.5307 

.5750 

-.5948 

APPENDIX  C-6 


ALL  ENGINES 
N » 22 


[ 

t 

E 

! Representative  combinations  of  independent  variables  which 


may  produce  cost  estimating  relationships  which  are  least  affected 


by  multicoUinearity  between  the  independent  variables. 


I 

f 

I Combination  #1 

1 


Combination  #2 


XI  Number  of  Compressor  Rotors  | 

X4  Number  of  Turbine  Blades 

X5  Number  of  Burner  Cans 

X7  Engine  Diameter 

X8  Length  to  Diameter  Ratio 

X20  Cruise  RPM 

X4  Number  of  Turbine  Blades 

X5  Number  of  Burner  Cans 

X6  Engine  Length 

X7  Engine  Diameter 

X8  Length  to  Diameter  Ratio 

X9  Engine  Dry  Weight 

XIO  Thrust  to  Weight  Ratio 

X20  Cruise  RPM 
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APPENDIX  C-7 


TURBOJETS 
N » 15 

Bivariate  Correlation  with  COST 


r„  > + . 5000 
o ~ — 


X3  (TR) 


5168 


X25  (AFHBO)  1.0000 


COST  with 


APPENDIX  C-10 


TURBOFANS 
N = 7 

Independent  Variables  with  Tq  > ^,7000 
Bivariate  Correlation  with  COST 


Name 

Number 

^o 

Number  of  Turbine  Blades 

X4 

8998 

Engine  Length 

X6 

.7398 

Engine  Diameter 

X7 

8054 

Length  to  Diameter  Ratio 

X8 

.7403 

Thrust  to  Weight  Ratio 

XIO 

.8197 

Normal  Specific  Fuel  Consumption 

X17 

.7808 

Criiise  RPM 

X20 

.7599 

Maximum  RPM 

X21 

.7479 

Fan  By-pass  Ratio 

• X22 

.7071 

Engine  Pressure  Ratio 

X23 

.8254 

Maximum  Rated  Airflow 

X24 

.7137 

100 


Controlling 


APPENDIX  C-13 


X4  (TB)  -0.1623  0.  17  0.90 


APPENDIX  C-14 


1 


i 

3 

I; 

ALL  ENGINES  f 

Average  Percent  Predictability  of  Model  Using  Test  Engines  I 

Actual  Predicted  % ;j 

Engine Cost* Cost Predictability  | 


J57-P-23 

$20.78 

$488. 68 

2251.  0 

J60-P-3 

11.94 

408. 84 

2237.0 

J75-P-17 

34.36 

444.77 

1194.0 

J75-P-19 

9.61 

461.91 

4706.0 

TF33-P-9 

4.61 

554. 29 

11923.0 

♦ 

COST  = Average  core- 

engine  overhaul  cost/flight  hour 

Average  percent  predictability  = 4461.0 


2477  5.49  0.00 


APPENDIX  C-17 


TURBOFANS 

Average  Percent  Predictability  of  Model  Using  Teat  Engines 


Actual 

Predic  ted 

Percent 

Engine 

Cost 

Cost 

Predictability 

TF-30-P-7 

$30.64 

$18.02 

41.  0 

TF-39-GE-1 

132.41 

660. 86 

399.0 

Average  percent  predictability  = 220.  0 


APPENDIX  C-19 


[ 

t 

I 

I 

I 

I 

I 

i 

! 

I 

; 

I 

i 
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TURBOFANS  (with  afterburner) 

Average  Percent  Predictability  of  Model  Using  Test  Engines 


Engine 


Actual  Predicted  Percent 

Cost Cost Predictability 


TF30-P-100  $77.55  $133.39  72.0 


Average  percent  predictability  = 72.  0 
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APPENDIX  C-21 


TURBOFANS  (without  afterburner) 

Average  Percent  Predictability  of  Model  Using  Test  Engines 


Engine 


Actual 

Cost 


P redic  ted 
Cost 


Percent 

Predictability 


TF39-GE-i 


$132.41 


$49. 85 


62.  35 


Average  percent  predictability  = 62.  35 


APPENDIX  C-23 


TURBOJETS  (without  aifterburner) 

Average  Percent  Predictability  of  Model  Using  Test  Engines 


I 


Ensine 


Actual 

Cost 


Predicted 
Cost 


Percent 

Predictabilit 


J57-P-21  $54.64  $40.79  25.0 

J85-GE-13  4.76  30.71  545.0 

Average  percent  predictability  = 285.0 
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APPENDIX  C-24 


Independent  Variables 

Average  Percent  Predictability 

Number  of  engine  cases  us^d  to  develop  the  CER 


APPENDIX  C-25 


ALL  ENGINES 
N » 22 


Initial  Independent  Variables  Considered  for 
Furtber  CER  Development 


Name 

Number 

Abbreviation 

Number  of  Compressor  Rotors 

XI 

CR 

Number  of  Con^ressor  Blades 

X2 

CB 

Number  of  Turbine  Blades 

X4 

TB 

Number  of  Burner  Cans 

X5 

BC 

Engine  Length 

X6 

L 

Engine  Diameter 

X7 

D 

Length  to  Diameter  Ratio 

X8 

LD 

Engine  Dry  Weight 

X9 

W 

Thrust  to  Weight  Ratio 

XIO 

TW 

Rated  Thrust  at  Military  Power 

X12 

MILT 

Rated  Turbine  Inlet  Temperature 
at  Maximum  Power 

X16 

MAXTIT 

Normal  Specific  Fuel  Consumption 

X17 

NORMSFC 

Military  Specific  Fuel  Consumption 

X18 

MILSFC 

Engine  Pressure  Ratio 

X23 

EPR 

♦ 

t 
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Name 


Number  Abbreviation 


Maximum  Rated  Airflow 

XZ4 

AIR 

Average  Flight  Hours  Between 

Overhaul 

X25 

AFHBO 

Total  Flight  Hours  (FY  1974, 

1975,  1976) 

X26 

FH 
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APPENDIX  C-26 


ALL  ENGINES 
N = 22 

Final  Independent  Variables  Considered  for 
Further  CER  Development 


Name 

Number 

Abbreviation 

Engine  Length 

X6 

L 

Engine  Dry  Weight 

X9 

W 

Thrust  to  Weight  Ratio 

XIO 

TW 

Rated  Turbine  Inlet  Temperature 
at  Maximum  Power 

X16 

MAXTIT 

Normal  Specific  Fuel  Consumption 

X17 

NORMSFC 

Military  Specific  Fuel  Consumption 

X18 

MILSFC  . 

Maximum  Rated  Airflow 

X24 

AIR 

Average  Flight  Hours  Between 

Overhaul 

X25  ' 

AFHBO 
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APPENDIX  C-28 

TURBOJETS 

N = 19  , 

Initial  Independent  Variables  Considered  for 
Further  CER  Development 


Name 

Number 

Abbreviation 

Number  of  Turbine  Blades 

X4 

TB 

Number  of  Burner  Cans 

X5 

BC 

Engine  Length 

X6 

L 

Engine  Diameter 

X7 

D 

Length  to  Diameter  Ratio 

X8 

LD 

Engine  Dry  Weight 

X9 

W 

Thrust  to  Weight  Ratio 

xio 

TW 

Rated  Thrust  at  Military  Power 

X12 

MILT 

Rated  Turbine  Inlet  Temperature 
at  Maximum  Power 

X16 

MAX  TIT 

Maximum  RPM 

X21 

MRPM 

Engine  Pressure  Ratio 

X23 

EPR 

Maximum  Rated  Airflow 

X24 

AIR 

Average  Flight  Hours  Between  Overhaul 

X25 

AFHBO 

Total  Flight  Hours  (FY  1974,  1975,  1976) 

X26 

FH 
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APPENDIX  C-29 

TURBOJETS 
N = 19 


Final  Independent  Variables  Considered  for 
Further  CER  Development 


Name 

Number 

Abbreviation 

Number  of  Turbine  Blades 

X4 

TB 

Number  of  Burner  Cans 

X5 

BC 

Engine  Length 

X6 

L 

Length  to  Diameter  Ratio 

X8 

LD 

Engine  Dry  Weight 

X9 

W 

Thrust  to  Weight  Ratio 

XIO 

TW 

Rated  Thrust  at  Military  Power 

X12 

MILT 

Rated  Turbine  Inlet  Temperature 
at  Maximum  Power 

X16 

MAX  TIT 

Engine  Pressure  Ratio 

X23 

EPR 

Maximum  Rated  Airflow 

X24 

AIR 

Average  Flight  Hours  Between  Overhaul 

X25 

AFHBO 

1 
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APPENDIX  C-30  I 

] 

1 

TURBOJETS 
N = 19 

Comparison  of  R^,  percent  prediction,  and  percent  efficiency  for 
varying  degrees  of  freedom*  One  thousand  random  selections  of  90^ 


of  the  sample  (N 

= 19). 

R^ 

% Pred. 

% Eff. 

D.F.  =4 

Mean 

.6928 

288. 99 

93.32 

Range 

1333  - .9988 

2.  18  - 67608. 20 

0 - 100 

Std  Dev. 

. 1668 

2229.94 

21.32 

D.F.  =7 
Mean 
Range 
Std  Dev. 


.5922  191.21 

.3925  - .9975  1.46  - 5223.63 

.1448  512.69 
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92.74 
0 - 100 
21.23 


A 


APPENDIX  C-31 


TURBOFANS 
N = 9 

Initial  Independent  Variables  Considered  for 
Further  CER  Development 


Name 

Number 

Abbreviation 

Number  of  Compressor  Rotors 

XI 

CR 

Number  of  Compressor  Blades 

X2 

CB 

Number  of  Turbine  Blades 

X4 

TB 

Thrust  to  Weight  Ratio 

XIO 

TW 

Engine  Pressure  Ratio 

X23 

EPR 

APPENDIX  C-32 

TURBO FANS 


Final  Independent  Variables  Considered  for 

Further  CER  Development 

Name 

Number 

Abbreviation 

Number  of  Compressor  Rotors 

XI 

CR 

Number  of  Turbine  Blades 

X4 

TB 

Thrust  to  Weight  Ratio 

XIO 

TW 

125 

L j 


Variable  Coefficient  Std  Error  F-Statistic  Significant  (95%) 


Final  Coat  Eatimating  Relationahip 


Variable  Coefficient  Std  Error  F -Statistic  Significant  (95%) 


APPENDIX  D 


MPEG 


To  perform  a more  thorough  amalysis  of  the  data,  the  com- 
puter program,  MPEG,  pages  136  through  140,  was  developed  based 
on  programs  utilized  in  a previous  thesis  effort  (9).  The  program, 
MPEG,  is  structured  so  that  the  user  may  choose  to  rxin  any  number 
of  iterations  desired. 

MPEG  requires  the  following  inputs: 

JPOWS  = the  number  of  data  cases  in  the  data  sample 
matrix 

JCOLS  s the  number  of  columns  in  the  data  sample 
matrix 

T1  s t -value  for  constructing  confidence  intervals 
when  the  test  engines  are  considered  indi- 
vidually 

T2  s t-value  for  constructing  confidence  intervals 
when  the  test  engines  are  considered  simul- 
taneously 

SAMPLE  3 the  percent  of  the  data  samples  to  be  used 
for  model  building,  e.g.,  .80. 

ITIMES  3 the  number  of  times  the  program  is  to  be 


JCOL 


3 the  column  index  in  the  data  matrix  of  the 
dependent  variable 

3 the  number  of  independent  variables  to  be 
used  in  building  the  regression  model 


I 


IND(J)  = the  individual  column  indices  in  the  data  matrix 
for  each  of  the  independent  variables  to  be  used 

The  MREG  program  randomly  splits  the  data  file  into  a per- 
centage, as  specified  by  the  value  input  for  SAMPLE,  for  building  a 

model  and  the  remainder  for  testing  the  model.  Then  MHEG  computes  | 

! 

the  (X'^X)"^  or  "C"  matrix  , develops  the  model,  and  computes  a pre- 
dicted cost  for  each  of  the  test  engines.  MB.EG  computes  confidence 
interval  half-widths  for  each  predicted  cost  and  computes  a confidence 
interval  around  the  predicted  value.  The  confidence  interval  half- 
widths and  the  confidence  intervals  are  computed  for  the  desired  level 
of  confidence,  e.g.,  95  percent,  as  specified  by  the  t-values  input  to 
the  program.  The  t-values  depend  on  the  difference  between  the 
sample  size  (N)  and  one  more  than  the  number  of  independent  variables 
in  the  equation  (P).  As  the  difference  gets  larger,  the  degrees  of 
freedom  increase  and  the  confidence  interval  narrows. 

The  predicted  cost  for  each  test  engine  is  compared  with  the 
actual  cost  for  the  test  engine  and  the  difference  between  them  is 
computed.  The  difference  is  divided  by  the  predicted  cost  and  tht 
actxial  cost  to  determine  its  percentage  of  each.  The  percentage  of 
predicted  cost  is  used  as  the  indicator  of  predictive  capability  of  the 
model  for  each  test  engine.  The  average  predictive  capability  for 
the  model  considering  all  test  engines  is  computed  by  summing  the 
individual  percentages  and  dividing  by  the  number  of  test  engines 
utilized.  This  value  is  retained  in  a data  array  for  each  iteration 
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Once  the  confidence  intervals  are  developed  for  the  test 
engines,  the  actual  cost  for  each  engine  is  compared  with  the  confi- 
dence interval  limits.  If  the  actual  cost  falls  within  the  confidence 
interval,  an  efficiency  counter  is  incremented.  When  all  test  engine 
actual  costs  have  been  compared,  the  value  of  the  efficiency  counter 
is  divided  by  the  number  of  test  engines  considered  to  determine  the 
predictive  efficiency  of  the  model  considering  all  test  engines.  The 
value  5s  also  retained  in  a data  array  for  each  iteration  of  the  pro- 
gram. If,  for  example,  the  confidence  interval  is  constructed  on  the 
basis  of  a 95  percent  confidence  level,  it  can  be  expected  that  in  the 
long  run,  possibly  after  1000  iterations,  an  acceptable  model  will  have 
an  average  predictive  efficiency  of  at  least  95  percent.  If  the  degrees 
of  freedom  are  increased  either  by  increasing  the  sample  size  or 
decreasing  the  number  of  variables  in  the  model,  the  confidence 
interval  will  then  narrow  and  the  model  may  or  may  not  be  able  to 
maintain  a 95  percent  efficiency. 
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*#RON  *-(OLIB)GRADLIB/TSS,R 
PARAMETER  Pl-33 , P2-30 , P3-P2/2 , P4-2*P1 
DIMENSION  CONF(Pl,l) ,EE(l,Pl) ,BUILD(P2,Pl) ,TEST{P2,Pl) 
DIMENSION  COSTP(l,P2) ,COSTD(l,P2) ,COSTPCT (1 , P2) 
4COSTPCP(l,P2) 

DIMENSION  X{P2,P1) ,XT(P1,P2) ,Y(P2,1) ,YT(1,P2) 

DIMENSION  HOLD(Pl,P4) ,G(Pl,l) ,B(Pl,l) ,BT(l,Pl) 

DIMENSION  ST(P2,P1) ,IND(Pl) ,IDOM(Pl) ,CON(l,Pl) 

DIMENSION  PARRAY(100,20) , EARRAY (100 , 20) , R2ARRAY ( 100 , 20) 
INTEGER  POINTR{P2) 
f REAL  MSE,MSR 

t READ (5,101) JROWS , JCOLS , T1 , T2 , SAMPLE , ITIMES 

READ(10,101) ( (ST(J,K) ,K«1, JCOLS) ,J*1, JROWS) 

ICOL  « 0 
IROW  » 1 
NTSTCTR  » 0 
NSOLCTR  » 0 
NEGVAL  » 0 
NODEGFRE  = 0 
DO  999  IM=1, ITIMES 
JB=1 
JTB-1 

DO  20  J-1,  JROWS 
101  FORMAT (V) 

IF  (RND(-1.0) .LE. SAMPLE)  GO  TO  9 
POINTR(JTB)»J 

DO  12  L*l, JCOLS 
12  TEST(JTB,L)=ST(J,L) 

JTB-JTB+1 
GO  TO  20 

9 DO  11  1=1, JCOLS 

11  BUILD(JB,I)=ST(J,I) 

JB»JB+1 
20  CONTINUE 

JTB=JTB-1 
JB-JB-1 
JROW=JB 

IF(JTB.EQ.0)  GO  TO  996 

READ(11,101,END=978) JY,JCOL, (IND(J) ,J  * l,JCOL) 
i 978  REWIND  11 

n . DO  26  I»l,JROW 

f Y(I,1)»BUILD(I,JY) 

t X(I,1)-1.0 

[ DO  26  J»l,JCOL 

> JJ-IND{J) 

i 26  X(I,J+l)»BOILD(I,JJ) 
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JCOWCOL+1 
DO  30  K«1,JR0W 
DO  30  J»l,JCOL 
30  XT(J,K)  » X(K,J) 

XX  « 0 

DO  70  J-1,  JCOL 
DO  60  L-1,  JCOL 
DO  50  K-1,  JROW 
50  XX  » XX  + XT(J,K)  * X(K,L) 

HOLD(J,L)  « XX 
XX  « 0 

60  CONTINUE 
70  CONTINUE 

K • JCOL  + 1 
L '•  JCOL  * 2 
N - K - 1 
DO  100  M-1,  JCOL 
DO  90  J-K,  L 
90  HOLD(M,J)  « 0 
N » N + 1 
100  HOLD(M,N}  > 1 

DO  160  K»l,  JCOL 
XX  - HOLD(K,K) 

IP  (XX)  110,  998,  110 
110  DO  120  J»1,L 
120  HOLD(K,J)  - HOLD(K,J)  / XX 
DO  150  J=l,  JCOL 
IP  (J  - K)  130,  150,  130 
130  XXX  = HOLD(J,K) 

DO  140  M=l,  L 

140  HOLD(J,M)  » HOLD(J,M)  - XXX  * HOLD(K,M) 
150  CONTINUE 
160  CONTINUE 

KK  - JCOL  + 1 
XX  « 0 

DO  190  J»l,  JCOL 
DO  180  K=l,  JROW 
180  XX  - XX  + XT(J,K)  * Y(K,1) 

G(J,1)  » XX 
XX  « 0 

190  CONTINUE 

L » JCOL  + 1 
LL  - JCOL  * 2 
DO  210  J-1,  JCOL 
M » 1 

DO  200  R>L,  LL 

XX  - XX  + G(M,1)  * eOLD(J,K) 

200  M « M + 1 
B(J,1)  - XX 
XX  « 0 

210  CONTINUE 

DO  220  J»l,JROW 
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220  YT(1,J)»Y(J,1) 

DO  230  J-l,JROW 
230  XX»XX+YT(1,J) *Y{J,1) 

YY-XX 

XX»0 

DO  240  J>l,JCOL 
240  BT(1,J)-B(J,1) 

DO  250  J«l,JCOL 
250  XX»XX+BT(1,J)*G(J,1) 

ZZ-0 

DO  255  J»l,JROW 
255  ZZ»ZZ+Y(J,1) 

R-JROW 

ZZ-{ZZ*ZZ)/R 

SST-YY-ZZ 

SSR»XX-ZZ 

SSE-YY-XX 

RR»(XX-ZZ)/(YY-ZZ) 

IR-JCOL-1 

IX»JROW-JCOL 

IF(IX.LT.l)GO  TO  969 

S2-SSE/IX 

MSR-SSR/IR 

MSE-SSE/IX 

P-MSR/MSE 

IDOM(1)»0 

DO  264  I»l,JCOL 

264  IDUM(I+1)»IND(I) 

N»0 

DO  265  K=L,LL 
N»N+1 

IF(S2.LT. (0.0) >GO  TO  975 
IF(HOLD.(N,K)  .LT.  (0.0)  )GO  TO  976 
XX«SQRT(S2)*SQRT(HOLD(N,K) ) 

265  CONTINUE 
JROW-JB+JTB 

SVAR»0.0 
PREDICT  » 0.0 

DO  88  I»l,JROW 
COSTP(I,l)-B(l,l) 

DO  89  L»2,JCOL 
VAR«ST(I,IND(L-1) )*B(L,1) 

£VAR»SVAR+VAR 
89  CONTINUE 

COSTP (1,1) -COSTP (1,1) +SVAR 
SVAR-0.0 

COSTD(I,l)-COSTP(I,l)-ST(I,JY) 
COSTPCP(I,l)»(COSTD(I,l)/COSTP(I,l) ) *100 
COSTPCT ( 1 , 1 ) - ( COSTD (1,1) /ST ( I , J Y ) ) * 1 0 0 
88  CONTINUE 
ECTR-0.0 

DO  4 I-1,JTB 
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VAL-0.0 

DO  5 J-1,JC0L 

5 BE(1,J)«0.0 
CON{1,1)«1.0 

DO  6 J»l,JCOL-l 
C0N(1,J+1)*TEST(I,1ND(J) ) 

6 CONTINUE 

DO  800  II-l,JCOL 
CONP (11,1) -CON (1,11) 

800  CONTINUE 

DO  620  12-1, JCOL 
DO  630  13-1, JCOL 

EE(1,I2)»EE(1,I2)+C0N(1,I3)*H0LD(I3,I2+JC0L) 

630  CONTINUE 
620  CONTINUE 

DO  660  13-1, JCOL 
VAL»VAL+EE(1,I3) *CONF(I3,l) 

660  CONTINUE 

VALU-VAL+1.0 
CI-T1*SQRT (VALU*MSE) 

CIA»T2*SQRT (VALU*MSE) 

IN-POINTR(I) 

PREDICT  » PREDICT  + ABS (COSTPCP (IN, 1) ) 

RANGEl-COSTP ( IN , 1 ) +CI 
RANGE  2-COSTP ( IN , 1 ) -C I 
RAGl-COSTP ( IN , 1 ) +CIA 
RAG2-COSTP ( IN , 1 ) -CIA 

IF( (RAG1-TEST(I,JY) ) .LT. (0.0) .OR. (RAG2-TEST(I, JY) ) 
&.GT. (0.0) ) GO  TO  4 
ECTR  » ECTR  +1.0 
4 CONTINUE 
TB-JTB 

ICOL  - ICOL  + 1 

PARRAY(IROW,ICOL)  - PREDICT/TB 
EARRAY(IROW,ICOL)  » ECTR/TB 
R2ARRAY(IROW,ICOL)  » RR 
IP( (ICOL-20) .NE.0)GO  TO  999 
ICOL  - 0 
IROW  - IROW  + 1 

995  GO  TO  999 

976  PRINT, "NEGATIVE  VALUES  IN  TBE  INVERSE  MATRIX" 

975  NEGVAL  » NEGVAL  + 1 
PRINT  977 
GO  TO  999 

996  NTSTCTR  « NTSTCTR  + 1 

997  GO  TO  999 

969  NODEGPRE  - NODEGFRE  + 1 
GO  TO  999 

998  NSOLCTR  - NSOLCTR  + 1 

999  CONTINUE 

992  PRINT  980, NTSTCTR, NSOLCTR, NEGVAL, NODEGFRE, ITIMES 
ICOLS  - 20 
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PRINT  979 

991  PRINT  101, ( (EARRAY(III,JJJ) ,JJJ-1,IC0LS) ,III-1,IR0W) 
PRINT  979 

990  PRINT  101, ( (PARRAY(III,JJJ) ,JJJ-1,IC0LS) ,III»1,IR0W) 
PRINT  979 

PRINT  101, ( (R2ARRAYfIII,JJJ) , JJJ»1, ICOLS) ,III-l,IROW) 

994  WRITE (12,101) ( (EARRAY (III, JJJ) , JJJ»1, ICOLS) , III-l , IROW) 
993  WRITE(13,101) ( (PARRAY (III, JJJ) ,JJJ-1, ICOLS) ,III»1, IROW) 
WRITE (14,101) ( (R2ARRAY(III,JJJ) ,JJJ-1 , ICOLS) ,III»l,IROW) 
STOP 

980  FORMAT (10X,10HNTSTCTR  » , 14 , 5X, 10HNSOLCTR  « ,I4,5X, 
&10HNEGVAL  » ,I4//5X,11HN0DEGFRE  =>  ,I4,9HITIMES  * ,14) 

977  FORMAT (/) 

979  FORMAT (////) 

END 
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